Metadata-Version: 2.4
Name: wfw-ai
Version: 3.1.0
Summary: WANI — Local & Cloud AI inference. Slash commands, tools, zero bloat.
Keywords: llm,gguf,llama,ai,inference,local-ai,huggingface,ollama,wani,wfw-ai
Author: Zain Ali
Requires-Python: >=3.8
Description-Content-Type: text/markdown
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Environment :: Console
License-File: LICENSE
Requires-Dist: requests>=2.28.0
Requires-Dist: tqdm>=4.64.0
Requires-Dist: llama-cpp-python>=0.2.90 ; extra == "all"
Requires-Dist: huggingface-hub>=0.20.0 ; extra == "all"
Requires-Dist: openai>=1.0.0 ; extra == "all"
Requires-Dist: anthropic>=0.20.0 ; extra == "all"
Requires-Dist: openai>=1.0.0 ; extra == "cloud"
Requires-Dist: anthropic>=0.20.0 ; extra == "cloud"
Requires-Dist: huggingface-hub>=0.20.0 ; extra == "hf"
Requires-Dist: llama-cpp-python>=0.2.90 ; extra == "local"
Project-URL: Homepage, https://github.com/zainali/wfw-ai
Project-URL: Repository, https://github.com/zainali/wfw-ai
Provides-Extra: all
Provides-Extra: cloud
Provides-Extra: hf
Provides-Extra: local

# wfw-ai · WANI v3.1

Local and cloud AI inference. Slash commands. Zero bloat.

```bash
pip install wfw-ai
wani
```

## Features

- Local GGUF models via llama-cpp-python
- Cloud: Groq (free) · OpenAI · Anthropic · Gemini · HuggingFace · Ollama
- Auto hardware detection (Oppo A6x, Snapdragon, desktop)
- KV cache quantization — 50% less RAM
- Flash Attention + mmap
- HuggingFace direct download with resume
- Slash commands: `/model /download /connect /system /edit /run /read /write /stats`
- Shell shortcut: `//ls -la`

## Install

```bash
pip install wfw-ai                        # core (cloud only)
pip install "wfw-ai[local]"               # + local GGUF support
pip install "wfw-ai[all]"                 # everything
```

## Usage

```bash
wani                                      # auto mode
wani --cloud groq --key gsk_...          # groq cloud
wani --model mymodel.gguf                # specific model
wani --download llama-3.2-3b             # download from HF
wani --list                              # list models
wani --prompt "Hello" --cloud groq       # single prompt
```

## Slash Commands

| Command | Action |
|---------|--------|
| `/model` | list & load local models |
| `/download` | download from HuggingFace |
| `/connect` | manage cloud connections |
| `/system <text>` | set system prompt |
| `/edit <file>` | open file in editor |
| `/run <cmd>` | run shell command |
| `/read <file>` | load file into context |
| `/write <file>` | save response to file |
| `/stats` | hardware + session stats |
| `/clear` | clear history |
| `//cmd` | shell shortcut |

## Environment Variables

```bash
export GROQ_API_KEY=gsk_...
export HF_TOKEN=hf_...
export ANTHROPIC_API_KEY=sk-ant-...
export GEMINI_API_KEY=...
export OPENAI_API_KEY=sk-...
```

## Oppo A6x / 4GB Devices

Auto-detected. Optimal settings applied: 4 threads, Q8 KV cache, mmap on, Flash Attention on.

Recommended models:
- `wani --download llama-3.2-1b` — 700MB
- `wani --download llama-3.2-3b` — 1.8GB ← best for 4GB

---
by Zain Ali · MIT License

