Metadata-Version: 2.4
Name: viincci-rag
Version: 2.0.0
Summary: Universal multi-domain research system with RAG (Retrieval-Augmented Generation) capabilities
Author-email: MrViincciLeRoy <Viincci@proton.me>
Maintainer-email: MrViincciLeRoy <Viincci@proton.me>
License: MIT
Project-URL: Homepage, https://github.com/viincci/Viincci-RAG
Project-URL: Documentation, https://github.com/viincci/Viincci-RAG
Project-URL: Repository, https://github.com/viincci/Viincci-RAG
Project-URL: Bug Tracker, https://github.com/viincci/Viincci-RAG/issues
Keywords: research,rag,nlp,ai,machine-learning,information-retrieval,multi-domain
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Science/Research
Classifier: Intended Audience :: Education
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Operating System :: OS Independent
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: requests>=2.31.0
Requires-Dist: beautifulsoup4>=4.12.0
Requires-Dist: lxml>=4.9.0
Requires-Dist: pandas>=2.0.0
Requires-Dist: torch>=2.0.0
Requires-Dist: transformers>=4.35.0
Requires-Dist: sentence-transformers>=2.2.0
Requires-Dist: faiss-cpu>=1.7.4
Requires-Dist: PyPDF2>=3.0.0
Requires-Dist: wikipedia-api>=0.6.0
Provides-Extra: dev
Requires-Dist: pytest>=7.4.0; extra == "dev"
Requires-Dist: pytest-cov>=4.1.0; extra == "dev"
Requires-Dist: pytest-mock>=3.11.0; extra == "dev"
Requires-Dist: flake8>=6.1.0; extra == "dev"
Requires-Dist: black>=23.0.0; extra == "dev"
Provides-Extra: cuda
Requires-Dist: torch>=2.0.0; extra == "cuda"
Provides-Extra: all
Requires-Dist: accelerate>=0.24.0; extra == "all"
Requires-Dist: bitsandbytes>=0.41.0; extra == "all"
Dynamic: license-file

# Viincci-RAG

> Universal multi-domain research system with RAG (Retrieval-Augmented Generation) capabilities

[![Python 3.9+](https://img.shields.io/badge/python-3.9+-blue.svg)](https://www.python.org/downloads/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Status: Beta](https://img.shields.io/badge/Status-Beta-orange.svg)]()

## 🎓 Try It Now (Google Colab)

**No installation required!** Run these notebooks in your browser:

- [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/MrViincciLeRoy/Viincci-RAG/blob/main/viincci_rag/examples/test_examples.ipynb) **Minimal Examples** — Safe mock mode + real SerpAPI integration
- [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/MrViincciLeRoy/Viincci-RAG/blob/main/Test.ipynb) **Complete Testing** — All domains (poetry, medical, botany, art, carpentry)

## 🚀 Quick Start

### Installation

```bash
# Install from source
pip install -e .

# With development dependencies
pip install -e ".[dev]"

# All features
pip install -e ".[all]"
```

### Basic Usage

```python
from viincci_rag import ConfigManager, RAGSystem, UniversalResearchSpider

# Initialize configuration
config = ConfigManager(domain="botany")

# Create RAG system
rag = RAGSystem(config)
rag.load_llm()

# Create research spider
spider = UniversalResearchSpider(config)

# Or import all classes
from viincci_rag import *
```

## 📦 What's Included

| Component | Purpose |
|-----------|---------|
| `ConfigManager` | Configuration management with domain support |
| `RAGSystem` | Retrieval-Augmented Generation pipeline |
| `UniversalResearchSpider` | Multi-domain research and web scraping |
| `UniversalArticleGenerator` | Content generation for any domain |
| `SerpAPIMonitor` | API credit monitoring and management |
| `FloraDatabase` | Database operations and management |

## 🧪 Testing

```bash
# Run all tests
pytest tests/

# Run integration tests
pytest tests/test_integration.py -v

# With coverage report
pytest tests/ --cov=viincci_rag --cov-report=html
```

## 📚 Documentation

All documentation has been moved to the `docs/` folder:

- **[docs/DOCS.md](docs/DOCS.md)** - Documentation index and quick reference
- **[docs/MIGRATION.md](docs/MIGRATION.md)** - Complete migration guide  
- **[docs/RESTRUCTURING_SUMMARY.md](docs/RESTRUCTURING_SUMMARY.md)** - Architecture overview
- **[docs/BACKWARD_COMPATIBILITY_SHIMS.md](docs/BACKWARD_COMPATIBILITY_SHIMS.md)** - Technical details
- **[docs/package_structure.md](docs/package_structure.md)** - Packaging guide

## 🔄 Backward Compatibility

All old imports continue to work:

```python
# Old import (still works)
from V4 import ConfigManager, RAGSystem

# New import (recommended)
from viincci_rag import ConfigManager, RAGSystem

# Both are identical
```

## 📋 Project Structure

```
viincci_rag/
├── core/              # Core RAG modules with wrappers
├── database/          # Database adapters
├── utils/             # Utility functions
├── config/            # Configuration files
└── templates/         # Output templates

V4/                    # Original codebase (unchanged)
docs/                  # Documentation
tests/                 # Test suite
```

## 🤝 Contributing

Contributions welcome! Please:

1. Fork the repository
2. Create a feature branch
3. Make your changes
4. Run tests: `pytest tests/`
5. Submit a Pull Request

## 📄 License

MIT License - See [LICENSE](LICENSE) file for details

## 📞 Support

- 📧 Issues: [GitHub Issues](https://github.com/viincci/viincci-rag/issues)
- 📖 Documentation: [docs/](docs/)
- 💬 Discussions: [GitHub Discussions](https://github.com/viincci/viincci-rag/discussions)

---

**Version**: 4.0.0 | **Status**: Beta | **License**: MIT
```

## ✨ Features

- 🔬 **Multi-Domain Research**: Botany, medical, mathematics, carpentry, and more
- 🤖 **RAG System**: Retrieval-Augmented Generation for intelligent answers
- � **Multiple Database Backends**: SQLite, PostgreSQL, MongoDB, MySQL
- 🎯 **API Monitoring**: Built-in SerpAPI credit tracking
- ⚙️ **Fully Configurable**: Models, databases, content processing
- ✅ **Tested & Documented**: Comprehensive test suite and documentation
- 🔄 **Backward Compatible**: All old imports still work
# Clone the repository
git clone https://github.com/yourusername/viincci-rag.git
cd viincci-rag

# Install in development mode
pip install -e .

# Or install with all dependencies
pip install -e ".[all]"
```

### For Development

```bash
# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Or use the built-in test command
viincci-test
```

## 🔑 Setup

1. **Get a SerpAPI Key**: Sign up at [serpapi.com](https://serpapi.com/)

2. **Set Environment Variable**:
   ```bash
   export SERP_API_KEY='your_api_key_here'
   ```

3. **Verify Installation**:
   ```bash
   viincci-research --list-domains
   ```

## 📖 Usage

### Command Line Interface

#### Basic Research Article
```bash
viincci-research -q "Rosa rubiginosa" -d botany
```

#### Plain Text Output
```bash
viincci-research -q "diabetes" -d medical --format text
```

#### JSON Structured Output
```bash
viincci-research -q "Pythagorean theorem" -d mathematics --format json
```

#### Arts & Humanities Research
```bash
viincci-research -q "Impressionism" -d art_history
viincci-research -q "Shakespeare sonnets" -d literature
```

#### Creative Writing with RAG
```bash
# Generate a poem
viincci-research -q "Van Gogh" -d art_history --content-type poem --rag

# Generate an essay
viincci-research -q "Baroque music" -d music --content-type essay --rag
```

#### Check API Status
```bash
viincci-research --check-credits
```

### Python API

```python
from V4 import ConfigManager, UniversalResearchSpider, RAGSystem, UniversalArticleGenerator

# Initialize configuration for a domain
config = ConfigManager(domain="mathematics", verbose=True)

# Perform research
spider = UniversalResearchSpider(config)
sources = spider.research("Pythagorean theorem")

# Generate article with RAG
rag = RAGSystem(config)
texts = [s['text'] for s in sources]
metadata = [s['metadata'] for s in sources]
rag.build_index(texts, metadata)
rag.load_llm()

generator = UniversalArticleGenerator(config, rag_system=rag)
article = generator.generate_full_article("Pythagorean theorem", sources)

# Save article
with open("article.html", "w") as f:
    f.write(article)
```

### Domain Information

```bash
# List all domains
viincci-research --list-domains

# Get detailed domain info
viincci-research --domain-info medical
```

## 🏗️ Architecture

```
viincci-rag/
├── V4/                          # Main package
│   ├── __init__.py             # Package exports
│   ├── ConfigManager.py        # Configuration management
│   ├── Spider.py               # Web scraping & search
│   ├── RagSys.py              # RAG system implementation
│   ├── UniversalArticleGenerator.py  # Article generation
│   ├── ApiMonitor.py          # API credit monitoring
│   ├── FloraDatabase.py       # Database operations
│   ├── config/                # Configuration files
│   │   ├── domains.json       # Domain definitions
│   │   ├── ai_settings.json   # AI model settings
│   │   ├── api_monitor.json   # API monitoring config
│   │   └── ...
│   └── db/                    # Database directory
├── research_cli.py            # Command-line interface
├── test_v4.py                # Test suite
├── requirements.txt          # Dependencies
├── setup.py                  # Package setup
└── README.md                 # This file
```

## 🧪 Testing

Run the comprehensive test suite:

```bash
# Using pytest
pytest

# Using built-in test runner
viincci-test

# With verbose output
viincci-test --verbose

# Run specific test
pytest tests/test_config.py -v
```

## 📊 Configuration

All configuration is stored in `V4/config/` as JSON files:

- **domains.json**: Define research domains, sources, questions
- **ai_settings.json**: LLM and embedding model settings
- **api_monitor.json**: API usage thresholds and alerts
- **search_config.json**: Web scraping parameters
- **domain_reliability.json**: Source reliability scores

### Example: Add a New Domain

Edit `V4/config/domains.json`:

```json
{
  "your_domain": {
    "name": "Your Domain Name",
    "description": "Description of your domain",
    "primary_sources": ["university", "research_institute"],
    "questions": [
      "what are the key concepts",
      "what are the applications"
    ],
    "keywords": ["keyword1", "keyword2"]
  }
}
```

## 🔧 Advanced Features

### RAG System Customization

```python
from V4 import ConfigManager, RAGSystem

config = ConfigManager()

# Change LLM model
config.set_llm_model("LiquidAI/LFM-40B-MoE")

# Initialize RAG with custom settings
rag = RAGSystem(config)
rag.load_llm(device="cuda", load_in_8bit=True)

# Query with custom parameters
result = rag.query(
    "What are the benefits?",
    k=10,
    max_new_tokens=500,
    temperature=0.8
)
```

### API Cost Estimation

```python
from V4 import SerpAPIMonitor, ConfigManager

config = ConfigManager()
monitor = SerpAPIMonitor(config)

# Estimate research cost
estimate = monitor.estimate_research_cost("Plant name", questions=4)
monitor.print_estimate(estimate)

# Check if can afford
if estimate['can_afford']:
    # Proceed with research
    pass
```

## 📚 Documentation

For detailed documentation, visit the [Wiki](https://github.com/yourusername/viincci-rag/wiki).

### Key Topics

- [Installation Guide](https://github.com/yourusername/viincci-rag/wiki/Installation)
- [Configuration Reference](https://github.com/yourusername/viincci-rag/wiki/Configuration)
- [API Documentation](https://github.com/yourusername/viincci-rag/wiki/API)
- [Domain Creation Guide](https://github.com/yourusername/viincci-rag/wiki/Creating-Domains)
- [RAG System Guide](https://github.com/yourusername/viincci-rag/wiki/RAG-System)

## 🤝 Contributing

Contributions are welcome! Please read our [Contributing Guidelines](CONTRIBUTING.md) first.

1. Fork the repository
2. Create a feature branch (`git checkout -b feature/AmazingFeature`)
3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request

## 📄 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🙏 Acknowledgments

- [SerpAPI](https://serpapi.com/) for search capabilities
- [Hugging Face](https://huggingface.co/) for transformers and models
- [FAISS](https://github.com/facebookresearch/faiss) for efficient similarity search
- [Wikimedia Commons](https://commons.wikimedia.org/) for images

## 📞 Support

- 📧 Email: Viincci@proton.me
- GitHub: [MrViincciLeRoy](https://github.com/MrViincciLeRoy)
- 🐛 Issues: [GitHub Issues](https://github.com/viincci/Viincci-RAG/issues)
- 💬 Discussions: [GitHub Discussions](https://github.com/viincci/Viincci-RAG/discussions)

## 🗺️ Roadmap

- [ ] Add more research domains
- [ ] Implement caching for search results
- [ ] Add web interface
- [ ] Support for more LLM providers
- [ ] Multilingual support
- [ ] Export to more formats (PDF, DOCX)
- [ ] Integration with reference managers

## 📈 Changelog

See [CHANGELOG.md](CHANGELOG.md) for version history.

---

**Made with ❤️ by the Viincci-RAG Team**
