Person Name Normalisation

Unifying person names in different notations

different sources write person names in different notations:

Firstname Secondname Lastname
Lastname, Firstname Secondname

also extracted are:

academic degrees (e.g. 'Dr.', 'Ph.D.')
name prefixes (e.g. 'van ter', 'von', 'De')

included: german, french, italian, dutch

missing: spanish, portuguese

missing: double Lastnames in Spanish

Installation

pip install personnamenorm

Usage

import personnamenorm as pnn
nameobj = pnn.namenorm('Dr. Dipl. Firstname Secondname von und zu Lastname')

results in

nameobj.name <dict>
{
    'raw': 'Dr. Dipl. Firstname von und zu Lastname',
    'firstname': ['Firstname','Secondname'],
    'lastname': ['Lastname'],
    'title': ['Dr.','Dipl.'],
    'prefix': ['von und zu']
}

nameobj.fullname <str>
'von und zu Lastname, Firstname Secondname'

nameobj.fullname_abbrev <str>
'von und zu Lastname, F S'

more examples can be found in this file on github.

Debug-mode

by default debug mode is off.

activating the debug mode

nameobj = pnn.namenorm(<str>, True)

returns additional information as logging message.

used annotation dictionary
annotated input string as list of tuples

Logging

logging is implemented

writes to std-out if logging IS NOT enabled before
writes to the existing logging handler if other logging IS enabled before

Test

see folder 'tests' on github.

python test_personnamenorm.py

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
NaiveBayes_on_names		NaiveBayes_on_names
personnamenorm		personnamenorm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
coding.py		coding.py
setup.py		setup.py
test_personnamenorm.py		test_personnamenorm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Person Name Normalisation

Unifying person names in different notations

Installation

Usage

results in

Debug-mode

Logging

Test

About

Releases 3

Packages

Contributors 2

Languages

License

klauslippert/person-name-normalisation

Folders and files

Latest commit

History

Repository files navigation

Person Name Normalisation

Unifying person names in different notations

Installation

Usage

results in

Debug-mode

Logging

Test

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Languages

Packages