Approximate nearest neighbor methods and vector models
1419 ワード
https://www.slideshare.net/erikbern/approximate-nearest-neighbor-methods-and-vector-models-nyc-ml-meetup
ソース:spotifyエンジニアリングリーダースライド Start with high dimensional data Run dimension reduction to 10-1000 dims Do stuff in a small dimensional space
Buliding an Annoy index start with the point set split it in 2 halves split again (until k items in each leaf, takes n/k memory instead n) binary tree
Search the closest isn't necessarily in the same leaf of the binary tree 2 points that are really close may end up on different sides of split → Go both sides of a split if it's close
Tricks query structure use priority queue to search all trees until we've k items take union and remove dupliates compute distance for remaining items return NN items
ソース:spotifyエンジニアリングリーダースライド
Annoy
Buliding an Annoy index
Search
pip install --user annoy
pip install pynndescent
Reference
この問題について(Approximate nearest neighbor methods and vector models), 我々は、より多くの情報をここで見つけました https://velog.io/@jkl133/Approximate-nearest-neighbor-methods-and-vector-modelsテキストは自由に共有またはコピーできます。ただし、このドキュメントのURLは参考URLとして残しておいてください。
Collection and Share based on the CC Protocol