Embeddings convert text (or images, audio) into arrays of numbers (vectors) that capture meaning. "King" and "Queen" have similar embeddings (both royalty). "King" and "Pizza" are far apart. Lets computers understand semantic similarity. Used for: search (find similar docs), recommendations (similar products), RAG (find relevant context), clustering (group similar items). Generated by models like OpenAI ada-002, Cohere. Store in vector databases.
Use embeddings for: semantic search (search by meaning not keywords), RAG systems (find relevant docs), recommendations (similar items), clustering/classification (group by meaning), or duplicate detection. Generate once, store in vector DB, search fast. Essential for any AI feature that needs to "find similar" items. OpenAI charges ~$0.0001 per 1K tokens to generate embeddings.
Ai Vocabulary
Convert words to numbers that capture meaning