feat: Add vectorized summary field and migration tools

- Add 'summary' field to Chunk collection (vectorized with text2vec)
- Migrate from Dynamic index to HNSW + RQ for both Chunk and Summary
- Add LLM summarizer module (utils/llm_summarizer.py)
- Add migration scripts (migrate_add_summary.py, restore_*.py)
- Add summary generation utilities and progress tracking
- Add testing and cleaning tools (outils_test_and_cleaning/)
- Add comprehensive documentation (ANALYSE_*.md, guides)
- Remove obsolete files (linear_config.py, old test files)
- Update .gitignore to exclude backups and temp files

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
2026-01-07 22:56:03 +01:00
parent feb215dae0
commit 636ad6206c
40 changed files with 11937 additions and 712 deletions

View File

@@ -0,0 +1,19 @@
@echo off
echo ========================================
echo REPRISE GENERATION RESUMES
echo ========================================
echo.
cd /d "%~dp0"
echo Chunks deja traites:
python -c "import json; p=json.load(open('summary_generation_progress.json')); print(f' -> {p[\"total_processed\"]} chunks traites')" 2>nul || echo -> Aucun chunk traite
echo.
echo Lancement de la generation...
echo (Ctrl+C pour arreter - progression sauvegardee)
echo.
python ..\..\utils\generate_all_summaries.py
pause