You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you have a look at all the attributes that spaCy generates for their tokens then you can imagine that some of these features can be useful for machine learning pipelines. To name a few:
is_oov: is the token part of the vocabulary/does it have a vector?
is_stop: is the token a stopword?
lemma_: what is the lemma of the token
pos/tag coarse/fine-grained part of speech information
If you have a look at all the attributes that spaCy generates for their tokens then you can imagine that some of these features can be useful for machine learning pipelines. To name a few:
is_oov
: is the token part of the vocabulary/does it have a vector?is_stop
: is the token a stopword?lemma_
: what is the lemma of the tokenpos
/tag
coarse/fine-grained part of speech informationThese can all have a discrete representation and could be added in general to a Rasa pipeline.
The text was updated successfully, but these errors were encountered: