Package: NUSS 0.1.0

Oskar Kosch

NUSS: Mixed N-Grams and Unigram Sequence Segmentation

Segmentation of short text sequences - like hashtags - into the separated words sequence, done with the use of dictionary, which may be built on custom corpus of texts. Unigram dictionary is used to find most probable sequence, and n-grams approach is used to determine possible segmentation given the text corpus.

Authors:Oskar Kosch [aut, cre]

NUSS_0.1.0.tar.gz
NUSS_0.1.0.zip(r-4.5)NUSS_0.1.0.zip(r-4.4)NUSS_0.1.0.zip(r-4.3)
NUSS_0.1.0.tgz(r-4.5-x86_64)NUSS_0.1.0.tgz(r-4.5-arm64)NUSS_0.1.0.tgz(r-4.4-x86_64)NUSS_0.1.0.tgz(r-4.4-arm64)NUSS_0.1.0.tgz(r-4.3-x86_64)NUSS_0.1.0.tgz(r-4.3-arm64)
NUSS_0.1.0.tar.gz(r-4.5-noble)NUSS_0.1.0.tar.gz(r-4.4-noble)
NUSS_0.1.0.tgz(r-4.4-emscripten)NUSS_0.1.0.tgz(r-4.3-emscripten)
NUSS.pdf |NUSS.html
NUSS/json (API)

# Install 'NUSS' in R:
install.packages('NUSS', repos = c('https://theogrost.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/theogrost/nuss/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
Datasets:

On CRAN:

Conda:

cpp

3.00 score 8 scripts 132 downloads 6 exports 46 dependencies

Last updated 8 months agofrom:2e104423fa. Checks:11 OK, 1 ERROR. Indexed: yes.

TargetResultLatest binary
Doc / VignettesOKMar 26 2025
R-4.5-win-x86_64OKMar 26 2025
R-4.5-mac-x86_64OKMar 26 2025
R-4.5-mac-aarch64OKMar 26 2025
R-4.5-linux-x86_64ERRORMar 26 2025
R-4.4-win-x86_64OKMar 26 2025
R-4.4-mac-x86_64OKMar 26 2025
R-4.4-mac-aarch64OKMar 26 2025
R-4.4-linux-x86_64OKMar 26 2025
R-4.3-win-x86_64OKMar 26 2025
R-4.3-mac-x86_64OKMar 26 2025
R-4.3-mac-aarch64OKMar 26 2025

Exports:igreplngrams_dictionaryngrams_segmentationnussunigram_dictionaryunigram_sequence_segmentation

Dependencies:BHclicpp11data.tabledigestdplyrdttenglishfansifloatgenericsgluelatticelexiconlgrlifecyclemagrittrMatrixMatrixExtramgsubmlapiNLPpillarpkgconfigpurrrqdapRegexR6RcppRcppArmadilloRhpcBLASctlrlangrsparseslamstringistringrsyuzhettext2vectextcleantextshapetibbletidyrtidyselectutf8vctrswithrzoo