Add Library RAG project and cleanup root directory

- Add complete Library RAG application (Flask + MCP server)
  - PDF processing pipeline with OCR and LLM extraction
  - Weaviate vector database integration (BGE-M3 embeddings)
  - Flask web interface with search and document management
  - MCP server for Claude Desktop integration
  - Comprehensive test suite (134 tests)

- Clean up root directory
  - Remove obsolete documentation files
  - Remove backup and temporary files
  - Update autonomous agent configuration

- Update prompts
  - Enhance initializer bis prompt with better instructions

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
2025-12-30 11:57:12 +01:00
parent 48470236da
commit d2f7165120
84 changed files with 26517 additions and 2 deletions

87
generations/library_rag/.gitignore vendored Normal file
View File

@@ -0,0 +1,87 @@
# Python
__pycache__/
*.py[cod]
*$py.class
*.so
.Python
build/
develop-eggs/
dist/
downloads/
eggs/
.eggs/
lib/
lib64/
parts/
sdist/
var/
wheels/
*.egg-info/
.installed.cfg
*.egg
# Virtual environments
venv/
ENV/
env/
.venv/
# IDE
.idea/
.vscode/
*.swp
*.swo
*~
# Environment variables
.env
.env.local
# Logs
*.log
logs/
# OS
.DS_Store
Thumbs.db
# Output files (large generated files)
output/*/images/
output/*/*.json
output/*/*.md
# Keep output folder structure
!output/.gitkeep
# Temporary files
*.tmp
*.bak
*.backup
temp_*.py
cleanup_*.py
# Type checking outputs
mypy_errors.txt
*_errors.txt
# Test PDFs (keep input/ folder but ignore PDFs)
input/*.pdf
# Node artifacts (not a Node.js project)
package-lock.json
# Linear backup files
.linear_project.json.backup
# PRPs directory (project request proposals - temporary)
PRPs/
# Markdown working directory (conversion scripts + large source files)
md/