Test Dataset
Datasets | Description | Vector Dimensions | Number of Vectors | Number of Queries | Data Volume | Metric Type |
---|---|---|---|---|---|---|
Sift-128-euclidean | Collected based on Texmex and uses SIFT algorithm to get image feature vectors | 128 | 1,000,000 | 10,000 | 488 MB | L2 |
Gist-960-euclidean | Collected based on Texmex and uses GIST algorithm to get image feature vectors | 960 | 1,000,000 | 1,000 | 3.57 GB | L2 |
Glove-200-angular | Collected based on network text data and uses GloVe algorithm to get word vectors | 200 | 1,183,514 | 10,000 | 902 MB | IP |
Deep-image-96-angular | Trained GoogLeNet on ImageNet and extracted from the last neural network layer to get vectors | 96 | 9,990,000 | 10,000 | 3.57 GB | IP |
Glove-100-angular | Collected based on network text data and uses GloVe algorithm to get word vectors | 100 | 1,183,514 | 10,000 | 463 MB | IP |
Mnist-784-euclidean | Collected from MNIST database | 784 | 60,000 | 10,000 | 179 MB | L2 |
Table 22 Test Dataset
Updated about 1 year ago