Releases: TimSchopf/KeyphraseVectorizers
Releases · TimSchopf/KeyphraseVectorizers
v0.0.13
v0.0.12
Add spacy.Language as valid argument for 'spacy_pipeline'
This release allows to reuse an object from spacy.load
for many different KeyphraseVectorizer objects. This release includes PR #19
Custom POS-tagger feature added
Higher compatibility with available SpaCy pipelines
Added 'stop_words'=None option
Add stopwords download automation
v0.0.7 Signed-off-by: Tim Schopf <tim.schopf@t-online.de>
Change "multiprocessing" parameter to "workers" parameter
change "multiprocessing" parameter to "workers" parameter Signed-off-by: Tim Schopf <tim.schopf@t-online.de>
Added min_df and max_df parameters, added support for documents that have more than 1000000 characters, and limit max keyphrase length to 8 words to prevent memory issues
update scipy requirements Signed-off-by: Tim Schopf <tim.schopf@t-online.de>
Increased efficiency of spaCy pipeline for POS tagging
v0.0.4 v0.0.4, increased efficiency of spaCy pipeline for POS tagging + adde…