Soohoon, co-founder of Greptile, explores the challenges of developing AI for codebase semantic search, highlighting the complexities compared to searching textual data like books. Unlike natural language, code and queries often lack direct semantic similarity, making it difficult to efficiently match code snippets to queries. Experiments show that translating code into natural language descriptions enhances the performance of semantic searches, as these representations yield higher similarity scores with queries. Additionally, the granularity of chunking is crucial; smaller, more focused chunks, such as individual functions, improve search accuracy by reducing noise, whereas larger chunks, like entire files, introduce irrelevant information that diminishes retrieval quality.