Metadata-Version: 2.4
Name: dittomation
Version: 1.0.0
Summary: Android automation made simple - record, replay, and control Android devices
Author: DittoMation Team
License: MIT
Project-URL: Homepage, https://github.com/OmPrakashSingh1704/DittoMation
Project-URL: Repository, https://github.com/OmPrakashSingh1704/DittoMation
Project-URL: Documentation, https://github.com/OmPrakashSingh1704/DittoMation#readme
Project-URL: Changelog, https://github.com/OmPrakashSingh1704/DittoMation/blob/main/CHANGELOG.md
Project-URL: Issues, https://github.com/OmPrakashSingh1704/DittoMation/issues
Keywords: android,automation,adb,testing,mobile
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Programming Language :: Python :: 3.13
Classifier: Topic :: Software Development :: Testing
Classifier: Topic :: System :: Hardware :: Hardware Drivers
Classifier: Typing :: Typed
Requires-Python: >=3.8
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: click>=8.0.0
Provides-Extra: dev
Requires-Dist: pytest>=7.0.0; extra == "dev"
Requires-Dist: pytest-cov>=4.0.0; extra == "dev"
Requires-Dist: black>=23.0.0; extra == "dev"
Requires-Dist: ruff>=0.1.0; extra == "dev"
Requires-Dist: mypy>=1.0.0; extra == "dev"
Requires-Dist: pre-commit>=3.0.0; extra == "dev"
Requires-Dist: bandit>=1.7.0; extra == "dev"
Provides-Extra: cloud
Requires-Dist: boto3>=1.26.0; extra == "cloud"
Requires-Dist: google-cloud-storage>=2.0.0; extra == "cloud"
Requires-Dist: requests>=2.28.0; extra == "cloud"
Provides-Extra: aws
Requires-Dist: boto3>=1.26.0; extra == "aws"
Requires-Dist: requests>=2.28.0; extra == "aws"
Provides-Extra: firebase
Requires-Dist: google-cloud-storage>=2.0.0; extra == "firebase"
Provides-Extra: all
Requires-Dist: boto3>=1.26.0; extra == "all"
Requires-Dist: google-cloud-storage>=2.0.0; extra == "all"
Requires-Dist: requests>=2.28.0; extra == "all"
Requires-Dist: pyyaml>=6.0.0; extra == "all"
Dynamic: license-file

# DittoMation

[![PyPI version](https://img.shields.io/pypi/v/dittomation.svg)](https://pypi.org/project/dittomation/)
[![Python versions](https://img.shields.io/pypi/pyversions/dittomation.svg)](https://pypi.org/project/dittomation/)
[![CI](https://github.com/OmPrakashSingh1704/DittoMation/actions/workflows/ci.yml/badge.svg)](https://github.com/OmPrakashSingh1704/DittoMation/actions/workflows/ci.yml)
[![codecov](https://codecov.io/gh/OmPrakashSingh1704/DittoMation/branch/main/graph/badge.svg)](https://codecov.io/gh/OmPrakashSingh1704/DittoMation)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

Android automation framework that records touch interactions, maps them to UI elements, and replays them using smart element location. Supports natural language commands for intuitive automation.

## Features

- **Record** touch gestures (tap, swipe, long press, scroll, pinch) from Android device
- **Replay** recorded workflows with smart element location
- **Natural Language** command execution - describe actions in plain English
- **Smart Locators** - fallback chain (resource-id → content-desc → text → xpath → coordinates)
- **Intent-based App Launching** - reliable app opening via Android intents
- **Variables & Expressions** - dynamic scripts with `{{variable}}` syntax and safe expression evaluation
- **Control Flow** - if/else conditions, for/while/until loops for complex automation

## Requirements

- Python 3.8+
- Android SDK with ADB (Android Debug Bridge)
- Android device/emulator with USB debugging enabled

## Installation

### From PyPI (Recommended)

```bash
pip install dittomation
```

With optional cloud provider support:

```bash
pip install dittomation[aws]      # AWS Device Farm
pip install dittomation[firebase] # Firebase Test Lab
pip install dittomation[all]      # All extras
```

### From Source

1. Clone the repository:

```bash
git clone https://github.com/OmPrakashSingh1704/DittoMation.git
cd DittoMation
```

2. Install the package:

```bash
pip install -e .
```

For development:

```bash
pip install -e .[dev]
```

### Verify Installation

Ensure ADB is accessible:

```bash
adb devices  # Should show your connected device
```

## Project Structure

```
DittoMation/
├── recorder/
│   ├── adb_wrapper.py       # ADB command utilities
│   ├── event_listener.py    # Touch event capture via getevent
│   ├── ui_dumper.py         # UI hierarchy capture & parsing
│   ├── element_matcher.py   # Coordinate to element mapping
│   ├── gesture_classifier.py # Gesture recognition (tap/swipe/etc)
│   ├── workflow.py          # Workflow storage & management
│   ├── main.py              # Recording CLI
│   └── interactive_recorder.py # Manual step-by-step recording
├── replayer/
│   ├── locator.py           # Smart element location
│   ├── executor.py          # Gesture execution via ADB
│   ├── main.py              # Replay CLI
│   ├── text_runner.py       # Plain text command execution
│   └── nl_runner.py         # Natural language execution
├── docs/
│   └── approach.md          # Technical approach documentation
├── output/                  # Generated workflow files
└── README.md
```

## Usage

### Natural Language Runner (Recommended)

Execute Android actions using natural language:

```bash
# Single command
python replayer/nl_runner.py "Open YouTube, search for 'Mr. Beast', play latest video"

# Interactive mode
python replayer/nl_runner.py --interactive

# From file
python replayer/nl_runner.py --file instructions.txt
```

**Supported Commands:**
- `open [app]` - Open an app (Clock, YouTube, Settings, etc.)
- `tap [element]` - Tap on element by text/description
- `long press [element]` - Long press on element
- `swipe up/down/left/right` - Swipe gestures
- `scroll up/down` - Scroll gestures
- `type "text"` - Input text
- `search for "query"` - Search within current app
- `back` / `home` - Navigation
- `wait [seconds]` - Pause execution
- `call [number]` - Make a phone call
- `set alarm for 8:00` - Set an alarm
- `go to [url]` - Open URL in browser
- `play first/latest video` - Play video results
- `copy last number I called` - Copy to clipboard
- `search for that number you copied` - Use clipboard

### Recording Workflows

```bash
# Interactive recording (recommended for emulators)
python recorder/interactive_recorder.py --output my_workflow.json

# Automated recording via getevent (physical devices)
python recorder/main.py --output my_workflow.json
```

### Replaying Workflows

```bash
# Replay a recorded workflow
python replayer/main.py --workflow my_workflow.json

# With custom delay between steps
python replayer/main.py --workflow my_workflow.json --delay 1000
```

### Scripted Automation with Variables

Run automation scripts with variables and control flow:

```bash
# Basic script execution
ditto run script.json

# With command-line variables
ditto run login.json --var username=myuser --var password=secret

# With variables file
ditto run script.json --vars-file config.json --verbose
```

Example script with variables and conditions (`login.json`):

```json
{
  "name": "smart_login",
  "variables": {
    "username": "testuser",
    "max_retries": 3
  },
  "steps": [
    {"action": "open", "app": "MyApp"},
    {
      "action": "if",
      "expr": "element_exists(text='Welcome')",
      "then_steps": [
        {"action": "log", "message": "Already logged in"}
      ],
      "else_steps": [
        {"action": "tap", "text": "Login"},
        {"action": "type", "value": "{{username}}"},
        {"action": "tap", "text": "Submit"}
      ]
    }
  ]
}
```

See [Variables and Control Flow Guide](docs/variables-and-control-flow.md) for complete documentation.

### Text-based Commands

```bash
# Simple text commands
python replayer/text_runner.py "tap Phone; tap Contacts; swipe down"

# From file
python replayer/text_runner.py commands.txt
```

## Examples

### Open YouTube and Play a Video
```bash
python replayer/nl_runner.py "Open YouTube, search for 'cooking tutorial', play first video"
```

### Make a Phone Call
```bash
python replayer/nl_runner.py "Call 1234567890, wait 30, end call"
```

### Set an Alarm
```bash
python replayer/nl_runner.py "Open clock, set alarm for 7:30 am"
```

### Copy and Search
```bash
python replayer/nl_runner.py "Open phone, copy last number I called, go home, open youtube, search for that number you copied"
```

## How It Works

See [docs/approach.md](docs/approach.md) for detailed technical documentation.

## Troubleshooting

### ADB not found
Set the `ANDROID_HOME` environment variable or add ADB to your PATH.

### UI dump timeout
The device may be busy. The tool will retry automatically. You can also:
```bash
adb shell pkill -f uiautomator
```

### Text input drops characters
This can happen on slow devices. The tool types in chunks with delays to mitigate this.

### App not found
Use the exact app name as it appears on the device, or use one of the supported intent-based apps (clock, settings, youtube, etc.).

## Contributing

Contributions are welcome! Please see [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
