-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
explosion spaCy Language-support Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
๐ Language Support Discussions
Discuss the language data and training models for new languages
Pinned to Language Support
-
๐ Adding models for new languages master thread
enhancementFeature requests and improvements lang / allGlobal language data new languageAdding support for new languages to spaCy.
Discussions
-
You must be logged in to vote ๐ Ancient Greek language
feat / lemmatizerFeature: Rule-based and lookup lemmatization lang / grcAncient Greek language data and models new languageAdding support for new languages to spaCy. -
You must be logged in to vote ๐ Help on building Akkadian language model from scratch
feat / tokenizerFeature: Tokenizer feat / morphologizerFeature: Morphologizer new languageAdding support for new languages to spaCy. -
You must be logged in to vote ๐ Using Udify with spacy versus spacy's built-in transformer capabilities for custom language (Akkadian)
feat / transformerFeature: Transformer new languageAdding support for new languages to spaCy. -
You must be logged in to vote ๐ WordNet for English Transformer models
lang / enEnglish language data and models modelsIssues related to the statistical models -
You must be logged in to vote ๐ Hungarian language
lang / huHungarian language data and models -
You must be logged in to vote ๐ "|" not set as is_punct
lang / daDanish language data and models lang / nbNorwegian (Bokmรฅl) language data and models -
You must be logged in to vote ๐ Procedure on adding alpha support for Maltese
new languageAdding support for new languages to spaCy. -
You must be logged in to vote ๐ Custom tokenization based on the sentence structure
feat / tokenizerFeature: Tokenizer -
You must be logged in to vote ๐ Hindi Language support
lang / hiHindi language data and models v2spaCy v2.x -
You must be logged in to vote ๐ Spanish lemmatizer doesn't work for future tense verbs
lang / esSpanish language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote ๐ Custom NER for other languages.
trainingTraining and updating models feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote ๐ Add a custom language to spacy
enhancementFeature requests and improvements -
You must be logged in to vote ๐ Ukrainian model proposal
enhancementFeature requests and improvements lang / ukUkrainian language data and models new languageAdding support for new languages to spaCy. -
You must be logged in to vote ๐ Lemmatization is not working for Chinese language
lang / zhChinese language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote ๐ Addition of "entity_ruler" in spacy 3.2 - Portuguese
lang / ptPortuguese language data and models feat / matcherFeature: Token, phrase and dependency matcher -
You must be logged in to vote ๐ Does spacy_hunspell support multiple languages?
third-partyThird-party packages and services -
You must be logged in to vote ๐ List of definition token.lemma, token.dep abbrev used in doc/token
docsDocumentation and website feat / docFeature: Doc, Span and Token objects -
You must be logged in to vote ๐ French model : tense of a verb is removed in version 3.x.
modelsIssues related to the statistical models lang / frFrench language data and models feat / morphologyFeature: Morphology and MorphAnalysis -
You must be logged in to vote ๐ Lemmatization for Indonesian Language support
lang / idIndonesian language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote ๐ How to train lemmatizer? Are lookup tables required?
feat / lemmatizerFeature: Rule-based and lookup lemmatization -
You must be logged in to vote ๐ Wrapping independently trained Pytorch model with Thinc
๐ฎ thincspaCy's machine learning library Thinc -
You must be logged in to vote ๐ French and Italian noun chunks, contributors are welcomed!
lang / itItalian language data and models lang / frFrench language data and models -
You must be logged in to vote ๐ Training data for English language models
lang / enEnglish language data and models -
You must be logged in to vote ๐ German lemmatizer confused by capitalization
lang / deGerman language data and models feat / lemmatizerFeature: Rule-based and lookup lemmatization