Metadata-Version: 2.4
Name: emotionlinmult
Version: 0.1.0
Summary: EmotionLinMulT: Transformer-based Emotion Recognition and Estimation.
Project-URL: Documentation, https://github.com/fodorad/emotionlinmult#readme
Project-URL: Issues, https://github.com/fodorad/emotionlinmult/issues
Project-URL: Source, https://github.com/fodorad/emotionlinmult
Author-email: fodorad <foauaai@inf.elte.hu>
License-Expression: MIT
License-File: LICENSE
Keywords: audio,emotion,intensity,multimodal,preprocess,text,video
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Requires-Python: >=3.11
Requires-Dist: decord
Requires-Dist: lightning
Requires-Dist: linmult
Requires-Dist: matplotlib
Requires-Dist: opencv-contrib-python
Requires-Dist: opencv-python
Requires-Dist: pillow
Requires-Dist: prettytable
Requires-Dist: pydub
Requires-Dist: pyyaml
Requires-Dist: scipy
Requires-Dist: seaborn
Requires-Dist: toml
Requires-Dist: torch==2.5.1
Requires-Dist: torchaudio==2.5.1
Requires-Dist: tqdm
Provides-Extra: dev
Requires-Dist: exordium; extra == 'dev'
Requires-Dist: matplotlib; extra == 'dev'
Requires-Dist: seaborn; extra == 'dev'
Requires-Dist: tqdm; extra == 'dev'
Description-Content-Type: text/markdown

# EmotionLinMulT

[![License](https://img.shields.io/badge/license-MIT-yellow.svg)](LICENSE)
[![python](https://img.shields.io/badge/Python-3.11-3776AB.svg?style=flat&logo=python&logoColor=white)](https://www.python.org)
[![pytorch](https://img.shields.io/badge/PyTorch-2.4.1-EE4C2C.svg?style=flat&logo=pytorch)](https://pytorch.org)

LinMulT is trained for categorical emotion recognition and emotion intensity estimation tasks on the RAVDESS dataset.

# Setup

### Install package from PyPI for inference

```
pip install emotionlinmult
```

### Install package for training

```
git clone https://github.com/fodorad/EmotionLinMulT
cd EmotionLinMulT
pip install -e .[all]
pip install -U -r requirements.txt
```

#### Supported extras definitions:

| extras tag | description                                                                               |
| ---------- | ----------------------------------------------------------------------------------------- |
| train      | dependencies for feature extraction, training the model from scratch and visualization    |
| all        | extends the 'train' dependencies for development. currently it is the same as 'train' tag |

# Related projects

### exordium

Collection of preprocessing functions and deep learning methods. This repository contains revised codes for fine landmark detection (including face, eye region, iris and pupil landmarks), head pose estimation, and eye feature calculation.

* code: https://github.com/fodorad/exordium

### (2022) LinMulT

General-purpose Multimodal Transformer with Linear Complexity Attention Mechanism. This base model is further modified and trained for various tasks and datasets.

* code: https://github.com/fodorad/LinMulT

### (2022) PersonalityLinMulT for personality trait and sentiment estimation

LinMulT is trained for Big Five personality trait estimation using the First Impressions V2 dataset and sentiment estimation using the MOSI and MOSEI datasets.

* paper: Multimodal Sentiment and Personality Perception Under Speech: A Comparison of Transformer-based Architectures ([pdf](https://proceedings.mlr.press/v173/fodor22a/fodor22a.pf), [website](https://proceedings.mlr.press/v173/fodor22a.html))
* code: https://github.com/fodorad/PersonalityLinMulT

### (2023) BlinkLinMulT

LinMulT is trained for blink presence detection and eye state recognition tasks.
Our results demonstrate comparable or superior performance compared to state-of-the-art models on 2 tasks, using 7 public benchmark databases.

* paper: BlinkLinMulT: Transformer-based Eye Blink Detection ([pdf](https://adamfodor.com/pdf/2023_Fodor_Adam_MDPI_BlinkLinMulT.pdf), [website](https://www.mdpi.com/2313-433X/9/10/196))
* code: https://github.com/fodorad/BlinkLinMulT

# Citation - BibTex

If you found our research helpful or influential please consider citing:

### (2023) BlinkLinMulT for blink presence detection and eye state recognition

```
@Article{fodor2023blinklinmult,
  title = {BlinkLinMulT: Transformer-Based Eye Blink Detection},
  author = {Fodor, Ádám and Fenech, Kristian and Lőrincz, András},
  journal = {Journal of Imaging},
  volume = {9},
  year = {2023},
  number = {10},
  article-number = {196},
  url = {https://www.mdpi.com/2313-433X/9/10/196},
  PubMedID = {37888303},
  ISSN = {2313-433X},
  DOI = {10.3390/jimaging9100196}
}
```

### (2022) LinMulT for personality trait and sentiment estimation

```
@InProceedings{pmlr-v173-fodor22a,
  title = {Multimodal Sentiment and Personality Perception Under Speech: A Comparison of Transformer-based Architectures},
  author = {Fodor, {\'A}d{\'a}m and Saboundji, Rachid R. and Jacques Junior, Julio C. S. and Escalera, Sergio and Gallardo-Pujol, David and L{\H{o}}rincz, Andr{\'a}s},
  booktitle = {Understanding Social Behavior in Dyadic and Small Group Interactions},
  pages = {218--241},
  year = {2022},
  editor = {Palmero, Cristina and Jacques Junior, Julio C. S. and Clapés, Albert and Guyon, Isabelle and Tu, Wei-Wei and Moeslund, Thomas B. and Escalera, Sergio},
  volume = {173},
  series = {Proceedings of Machine Learning Research},
  month = {16 Oct},
  publisher = {PMLR},
  pdf = {https://proceedings.mlr.press/v173/fodor22a/fodor22a.pdf},
  url = {https://proceedings.mlr.press/v173/fodor22a.html}
}
```

# Contact

* Ádám Fodor (foauaai@inf.elte.hu)
