You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the example above, entities are extracted by character-based matching. However, in some cases, character-based matching may not be suitable (e.g., the element symbol `na` for sodium matches name). In such cases, token-based matching can be used as follows:
39
-
40
-
```py
41
-
import spacy
42
-
from spacy.tokens import DocBin
43
-
from spacy_partial_tagger.tokenizer import CharacterTokenizer
44
-
45
-
text ="Selegiline - induced postural hypotension in Parkinson's disease: a longitudinal study on the effects of drug withdrawal."
You could download `config.cfg`[here](https://github.com/tech-sketch/spacy-partial-tagger/blob/main/config.cfg).
85
-
Or you could setup your own. This library would train models through spaCy. If you are not familiar with spaCy's config file format, please check the [documentation](https://spacy.io/usage/training#config).
44
+
This library is implemented as [a trainable component](https://spacy.io/usage/layers-architectures#components) in spaCy,
45
+
so you could control the training setting via spaCy's configuration system.
46
+
We provide you the default configuration file [here](https://github.com/tech-sketch/spacy-partial-tagger/blob/main/config.cfg).
47
+
Or you could setup your own. If you are not familiar with spaCy's config file format, please check the [documentation](https://spacy.io/usage/training#config).
86
48
87
49
Don't forget to replace `/path/to/train.spacy` and `/path/to/dev.spacy` with your own.
0 commit comments