fix(ingest): validate single file existence against git reference #558
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
🚀 Description
This PR addresses a critical logic gap in single-file (blob) ingestion. Previously, the system checked for file existence only on the local filesystem of the shallow clone, potentially ignoring the specific branch or commit requested. This implementation enforces strict Git-based validation as noted in the codebase TODO.
🛠️ Changes Made
ingest_queryiningestion.pyto usegit rev-parse --verifyto confirm the requested subpath exists within the provided Git reference (commit, tag, or branch).ValueErrorthat informs the user if a file is not found in the requested Git state, preventing the system from falling back to incorrect local file versions.✅ Testing Performed
mainworks as expected.ValueErrorinstead of the version from the default branch.ruff,pydoclint,isort) pass locally.🔗 Related Issues
Closes #557