any thoughts on embeddings for a codebase? should that be a vector index for each file (with a generated summary for each), and a giant list index on top? or is there a better way to design the indices? the goal is to have Q&A with an LLM on specific code as well as high-level questions, e.g., "how do files X and Y implement Z?"