Demystifying Text Embeddings: How Language Becomes Data