This module is used to skip Publications, which are already in Aleph.
Note
The module is using fuzzy lookup, see name_to_vector() and compare_names().
Convert name to the ASCII vector.
Example
>>> name_to_vector("ing. Franta Putšálek")
['putsalek', 'franta', 'ing']
Parameters: | name (str) – Name which will be vectorized. |
---|---|
Returns: | list – Vector created from name. |
Compare two names in complicated, but more error prone way.
Algorithm is using vector comparison.
Example
>>> compare_names("Franta Putšálek", "ing. Franta Putšálek")
100.0
>>> compare_names("F. Putšálek", "ing. Franta Putšálek")
50.0
Parameters: |
|
---|---|
Returns: | float – Percentage of the similarity. |
Filter publications based at data from Aleph.
Parameters: | publication (obj) – Publication instance. |
---|---|
Returns: | obj/None – None if the publication was found in Aleph or publication if not. |