GitPedia
CAMeL-Lab

CAMeL-Lab/camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

25 Releases
Latest: 1w ago
camel-tools v1.6.0v1.6.0Latest
owoowo·1w ago·June 8, 2026
GitHub

📋 Changes

  • Supported Python version are 3.11-3.14.
  • Updated dependencies.
  • Updated documentation.
  • Bug fixes for `BERTUnfactoredDisambiguator`.
  • Deprecated `pretrained_cache` in `BERTUnfactoredDisambiguator`.
  • Speed improvements for `simple_word_tokenize`.
camel-tools v1.5.7v1.5.7
owoowo·9mo ago·September 17, 2025
GitHub

Fixed a couple dependency issues.

camel-tools v1.5.6v1.5.6
owoowo·1y ago·April 15, 2025
GitHub

Fixed errors in `camel_morphology` and `camel_diac`.

camel-tools v1.5.5v1.5.5
owoowo·1y ago·August 23, 2024
GitHub

Added Python 3.12 support.

camel-tools v1.5.4v1.5.4
owoowo·1y ago·August 22, 2024
GitHub

Fixed doc generation issues.

camel-tools v1.5.3v1.5.3
owoowo·1y ago·August 22, 2024
GitHub

Added Python 3.11 support.

camel-tools v1.5.2v1.5.2
owoowo·3y ago·March 17, 2023
GitHub

Fixes doc build issues.

camel-tools v1.5.1v1.5.1
owoowo·3y ago·March 17, 2023
GitHub

📋 Changes

  • Fixed an issue where pip tries to install camel-kenlm on Windows.
camel-tools v1.5.0v1.5.0
owoowo·3y ago·March 17, 2023
GitHub

📋 Changes

  • Improved BERT disambiguation accuracy.
  • Added support for Python 3.10.
camel-tools v1.4.1v1.4.1
owoowo·4y ago·May 30, 2022
GitHub

📋 Changes

  • Fixed an issue importing Unfactored BERT Disambiguator.
camel-tools v1.4.0v1.4.0
owoowo·4y ago·May 24, 2022
GitHub

📋 Changes

  • Added Unfactored BERT disambiguator component.
  • Bug fixes
camel-tools v1.3.1v1.3.1
owoowo·4y ago·March 29, 2022
GitHub

📋 Changes

  • Fixed issue with downloading package catalogue on Google Colab.
camel-tools v1.3.0v1.3.0
owoowo·4y ago·March 29, 2022
GitHub

📋 Changes

  • Removed support for Python 3.6 (only 3.7-3.9 are now supported).
  • Implemented a new package manager for fine-grained installation of datasets.
  • Fixed GPU support for NER and Sentiment Analysis components.
  • Added emoji charsets.
  • [simple_word_tokenize](https://camel-tools.readthedocs.io/en/latest/api/tokenizers/word.html) now splits emojis correctly and can optionally split digits.
camel-tools v1.2.0v1.2.0
owoowo·4y ago·August 12, 2021
GitHub

📋 Changes

  • Updated documentation and added more examples
  • Morphology improvements and bug fixes
camel-tools v1.1.0v1.1.0
owoowo·5y ago·November 30, 2020
GitHub

This release adds the `camel_data` command line tool for simplifying downloading data sets.

camel-tools v1.0.1v1.0.1
owoowo·5y ago·November 15, 2020
GitHub

This update fixes installation issues caused by kenlm dependency.

camel-tools v1.0.0v1.0.0
owoowo·5y ago·November 10, 2020
GitHub

First official release of CAMeL Tools. See [this post](https://camel-lab.github.io/camel_tools_updates/2020/09/08/camel-tools-release-v1.0.0.html) for more information on this release.

camel-tools v0.4.dev5v0.4.dev5Pre-release
owoowo·6y ago·June 21, 2020
GitHub

This patch fixes errors in the `almor-msa` builtin database. It also provides a new parameter to `camel_tools.tokenizers.morphological.MorphologicalTokenizer` called `diac` that determines wether output tokens are diacritized or not.

camel-tools v0.4.dev4v0.4.dev4Pre-release
owoowo·6y ago·January 18, 2020
GitHub

📋 Changes

  • Fixed issue where diacritics and other marks broke tokenization.
  • Fixed escaping of NOAN replacements in backoff analyses.
  • Updated import of Mapping abstract base class for Python 3.
camel-tools v0.4.dev3v0.4.dev3Pre-release
owoowo·6y ago·December 16, 2019
GitHub

📋 Changes

  • Fixed handling of sun letters in CALIMA Star.
  • Added ANY keyword as values to certain features in CALIMA Star's reinflector.
camel-tools v0.4.dev2v0.4.dev2Pre-release
owoowo·6y ago·September 12, 2019
GitHub

Fixed a bug in CALIMA Star reinflector that prevented certain POS variants from being generated.

camel-tools v0.4.dev1v0.4.dev1Pre-release
owoowo·7y ago·May 16, 2019
GitHub

📋 Changes

  • Fixed an issue in CalimaStarReinflector that prevented it from generating analyses.
  • Fixed missing defines in the almor-msa database.
camel-tools v0.4.dev0v0.4.dev0Pre-release
owoowo·7y ago·May 9, 2019
GitHub

📋 Changes

  • SimpleDisambiguator has been changed to a more fully featured MLEDisambiguator.
  • Added word-boundary and morphological tokenizers.
  • Added text normalization utilities.
  • Analyzer APIs and Disambiguator APIs have been changed to be more general and output more descriptive named tuple objects.
  • Almor-msa database now includes extensions.
  • CALIMA Star Analyzer now has in-built caching mode.
  • CharMapper objects are now callable (no need to use `map_string()` method).
camel-tools v0.3.dev0v0.3.dev0Pre-release
owoowo·7y ago·February 26, 2019
GitHub

📋 Changes

  • Implemented a simple disambiguation function using pos-lex frequencies.
  • `camel_calima_star` has a new option to use simple disambiguation in analysis mode.
  • `CalimaStarAnalyzer` now has a new method `analyze_words()` to analyze a list of words.
camel-tools v0.2.dev0v0.2.dev0Pre-release
owoowo·7y ago·December 18, 2018
GitHub