Ngram Collection
Repositories tagged with "ngram"
Repositories tagged with "ngram"
ngram2vec
zhezhaoa
โFour word embedding models implemented in Python. Supporting arbitrary context featuresโ
albert_pytorch
lonePatient
โA Lite Bert For Self-Supervised Learning Language Representationsโ
ngrrram
wintermute-cell
โA TUI tool to help you type faster and learn new layouts. Includes a free cat.โ
ngram-type
ranelpadon
โTouch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.โ
daguan_2019_rank9
lonePatient
โdatagrand 2019 information extraction competition rank9โ
colibri-core
proycon
โColibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models. โ
refinr
ChrisMuir
โCluster and merge similar string values: an R implementation of Open Refine clustering algorithmsโ
ngram-language-model
joshualoehr
โPython implementation of an N-gram language model with Laplace smoothing and sentence generation. โ
stringdistance
vickumar1981
โA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..โ
n-gram
words
โGet n-grams from textโ
llm_corpus_quality
jiangnanboy
โๅคงๆจกๅ้ข่ฎญ็ปไธญๆ่ฏญๆๆธ ๆดๅ่ดจ้่ฏไผฐ Large model pre-training corpus cleaningโ
ngram
wrathematics
โFast n-Gram Tokenizationโ
suggest
suggest-go
โTop-k Approximate String Matching.โ
SRILM
BitSpeech
โMirror of SRILMโ