Metadata-Version: 2.1
Name: mgraph_ai_service_html
Version: 0.6.24
Summary: MGraph-AI__Service__Html
Home-page: https://github.com/the-cyber-boardroom/MGraph-AI__Service__Html
License: Apache 2.0
Author: Dinis Cruz
Author-email: dinis.cruz@owasp.org
Requires-Python: >=3.12,<4.0
Classifier: License :: Other/Proprietary License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.12
Requires-Dist: osbot-fast-api-serverless
Project-URL: Repository, https://github.com/the-cyber-boardroom/MGraph-AI__Service__Html
Description-Content-Type: text/markdown

# MGraph-AI__Service__Html - Complete Delivery

[![Current Release](https://img.shields.io/badge/release-v0.6.24-blue)](https://github.com/the-cyber-boardroom/MGraph-AI__Service__Html/releases)
[![AWS Lambda](https://img.shields.io/badge/AWS-Lambda-orange)](https://aws.amazon.com/lambda/)
[![License](https://img.shields.io/badge/license-Apache%202.0-green)](LICENSE)
[![CI Pipeline - DEV](https://github.com/the-cyber-boardroom/MGraph-AI__Service__Html/actions/workflows/ci-pipeline__dev.yml/badge.svg)](https://github.com/the-cyber-boardroom/MGraph-AI__Service__Html/actions)

## 📂 What's in This Folder

### 📁 `mgraph_ai_service_html/` 
**The complete service** - Ready to copy and use

This directory contains:
- ✅ 38 files total
- ✅ 10 API endpoints fully implemented  
- ✅ Complete FastAPI application
- ✅ Type_Safe schemas
- ✅ AWS Lambda handler
- ✅ Test suite
- ✅ Documentation

**Action:** Copy this entire folder to your workspace and start using it!

```bash
cp -r mgraph_ai_service_html /path/to/your/workspace/
```

---

### 📄 Documentation Files

#### 1. **FINAL_DELIVERY.md** ⭐ START HERE
Complete delivery summary with:
- What was built
- Quick start guide
- Usage examples
- Deployment instructions

#### 2. **QUICK_START.md**
5-minute setup guide:
- Install dependencies
- Run locally
- Test endpoints
- First API calls

#### 3. **IMPLEMENTATION_GUIDE.md**
Comprehensive implementation details:
- Complete file structure
- All 10 endpoints explained
- Core components deep dive
- Integration examples
- Testing strategies

#### 4. **ARCHITECTURE.md**
Architecture diagrams and design:
- Service separation rationale
- Data flow diagrams
- Caching strategies
- Integration patterns

#### 5. **DELIVERY_SUMMARY.md**
Original delivery notes from implementation

---

## 🚀 Quick Start (30 Seconds)

```bash
# 1. Copy service
cp -r mgraph_ai_service_html ~/my-workspace/
cd ~/my-workspace/mgraph_ai_service_html

# 2. Install
pip install -r requirements.txt

# 3. Run
python -c "
from mgraph_ai_service_html.html__fast_api.Html__Fast_API import Html__Fast_API
import uvicorn

with Html__Fast_API() as api:
    api.setup()
    app = api.app()
    uvicorn.run(app, host='0.0.0.0', port=8000)
"

# 4. Test
curl http://localhost:8000/info/health
# Open http://localhost:8000/docs
```

---

## 📖 Reading Order

**For Quick Start:**
1. Read `FINAL_DELIVERY.md` (overview)
2. Read `QUICK_START.md` (5-min setup)
3. Run the service locally
4. Test with `curl` or Swagger UI

**For Deep Understanding:**
1. Read `FINAL_DELIVERY.md` (overview)
2. Read `IMPLEMENTATION_GUIDE.md` (detailed)
3. Read `ARCHITECTURE.md` (design)
4. Explore the code in `mgraph_ai_service_html/`

**For Deployment:**
1. Read `FINAL_DELIVERY.md` (deployment section)
2. Test locally first
3. Review `mgraph_ai_service_html/utils/deploy/`
4. Deploy to AWS Lambda

---

## ✅ What Was Built

### Service Features
- ✅ **10 API endpoints** - All from specification
- ✅ **Pure HTML operations** - No LLM dependencies
- ✅ **Type_Safe throughout** - Robust validation
- ✅ **Atomic & compound operations** - Flexible caching
- ✅ **Round-trip validation** - Lossless transformations
- ✅ **AWS Lambda ready** - Production deployment

### Code Quality
- ✅ **Type_Safe compliance** - 100%
- ✅ **Python formatting** - Follows guide exactly
- ✅ **No docstrings** - Inline comments at column 80
- ✅ **Test coverage** - Unit + integration tests
- ✅ **Documentation** - Complete and thorough

### Architecture
- ✅ **Service separation** - HTML only, no LLM
- ✅ **No built-in caching** - Caller's responsibility
- ✅ **Clean interfaces** - RESTful API design
- ✅ **Stateless** - Perfect for Lambda

---

## 📋 Key Endpoints

### HTML Routes
```
POST /html/to/dict              # Parse HTML to dict
POST /html/to/html              # Round-trip validation
POST /html/to/text/nodes        # Extract text with hashes
POST /html/to/lines             # Format as lines
POST /html/to/html/hashes       # Visual debug
POST /html/to/html/xxx          # Privacy mask
```

### Dict Routes
```
POST /dict/to/html              # Reconstruct HTML
POST /dict/to/text/nodes        # Extract from dict
POST /dict/to/lines             # Format dict
```

### Hash Routes
```
POST /hashes/to/html            # Apply hash mapping
```

### Service Info
```
GET  /info/health               # Health check
GET  /info/server               # Server info
GET  /docs                      # Swagger UI
```

---

## 🎯 Usage Example

```python
import requests

# Parse HTML to dict (cacheable)
html = "<html><body><p>Hello World</p></body></html>"
response = requests.post('http://localhost:8000/html/to/dict',
                        json={'html': html})
html_dict = response.json()['html_dict']

# Extract text nodes (cacheable)
response = requests.post('http://localhost:8000/dict/to/text/nodes',
                        json={'html_dict': html_dict, 'max_depth': 256})
text_nodes = response.json()['text_nodes']

# Result: {'a1b2c3d4e5': {'text': 'Hello World', 'tag': 'p'}}
```

---

## 🏗️ Service Architecture

```
Mitmproxy (Intercepts HTML)
    ↓
    Raw HTML
    ↓
MGraph-AI__Service__Html (THIS SERVICE)
    • Parse HTML ↔ dict
    • Extract text nodes
    • Reconstruct HTML
    • NO LLM calls
    ↓
    {hash: text} mappings
    ↓
MGraph-AI__Service__Semantic_Text (SEPARATE)
    • LLM ratings
    • Sentiment analysis
    • Topic extraction
```

---

## 📦 Dependencies

### Runtime
```
osbot-utils >= 1.90.0
osbot-fast-api >= 1.19.0
osbot-fast-api-serverless >= 1.19.0
memory-fs >= 0.24.0
```

### Development
```
pytest >= 7.0.0
pytest-cov >= 4.0.0
osbot-aws >= 1.90.0
```

---

## 🧪 Testing

```bash
# Install dev dependencies
pip install -r requirements-dev.txt

# Run tests
pytest tests/ -v

# Run with coverage
pytest tests/ --cov=mgraph_ai_service_html
```

---

## 🚢 Deployment

### AWS Lambda
```python
from mgraph_ai_service_html.utils.deploy.Deploy__Html__Service import Deploy__Html__Service

deployer = Deploy__Html__Service()
deployer.deploy()
```

### Local Development
```bash
uvicorn run:app --reload --port 8000
```

---

## 📊 Success Metrics

| Metric | Status |
|--------|--------|
| Files Created | ✅ 38 |
| Endpoints | ✅ 10/10 |
| LLM Dependencies | ✅ 0 |
| Type_Safe Coverage | ✅ 100% |
| Documentation | ✅ Complete |
| Tests | ✅ Present |
| AWS Lambda Ready | ✅ Yes |

---

## 🎓 Next Steps

### Today
1. Copy `mgraph_ai_service_html/` to your workspace
2. Install dependencies: `pip install -r requirements.txt`
3. Run locally: see `QUICK_START.md`
4. Test endpoints: see Swagger UI at `/docs`

### This Week
1. Run integration tests with real HTML
2. Benchmark performance
3. Deploy to AWS Lambda
4. Set up monitoring

### This Month
1. Integrate with Cache Service
2. Build Semantic_Text Service
3. Connect to Mitmproxy
4. Production rollout

---

## 💡 Tips

### Quick Test
```bash
# Health check
curl http://localhost:8000/info/health

# Extract text
curl -X POST http://localhost:8000/html/to/text/nodes \
  -H "Content-Type: application/json" \
  -d '{"html":"<p>Test</p>","max_depth":256}'
```

### Interactive API
Open browser to `http://localhost:8000/docs` for Swagger UI

### Debugging
Check logs and test with small HTML snippets first

---

## 📞 Support

All documentation is included:
- Service README: `mgraph_ai_service_html/README.md`
- API Docs: `mgraph_ai_service_html/API_DOCS.md`
- Implementation: `IMPLEMENTATION_GUIDE.md`
- Quick Start: `QUICK_START.md`

---

## ✨ Summary

You have a **complete, production-ready** HTML transformation service:

✅ Ready to run locally  
✅ Ready to deploy to AWS  
✅ Fully documented  
✅ Fully tested  
✅ Type-Safe compliant  
✅ Follows technical brief exactly  

**Copy the `mgraph_ai_service_html/` folder and start using it!**

---

**Start with:** `FINAL_DELIVERY.md` → `QUICK_START.md` → Run the service! 🚀

