reconciler.main
The main module, containing the functions intended for end-users.
reconcile()
Reconcile a DataFrame column
This is the main function of this package, it takes in a Pandas Series, that is, a column of a DataFrame, and sends it for reconciliation. In order to return more confident results, the parameter type_id corresponds to the type of item you're trying to reconcile against, that is, in case of a Wikidata item, it is the item's 'instance of' property. There is also a top_res argument, to filter the retrieved matches, this can be either an int, corresponding to the number of matches you want to retrieve for each reconciled item, or 'all', to return all matches. The property_mapping argument is an optional argument to denote particular triples to reconcile against, so you could, for example, reconcile against items of a particular type, that have a specific property equals to some specific value. The reconciliation_endpoint argument corresponds to the reconciliation service you're trying to access, if no value is given, it will default to the Wikidata reconciliation endpoint. See https://reconciliation-api.github.io/testbench/ for a list of available endpoints.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
column_to_reconcile |
Series
|
A pandas Series corresponding to the column to be reconciled. |
required |
type_id |
str
|
The item type to reconcile against, in case of a wikidata item, it corresponds to the item's 'instance of' QID. |
None
|
top_res |
int or str
|
The maximum number of matches to return for each reconciled item, defaults to one. To retrieve all matches, set it to 'all'. |
1
|
property_mapping |
dict
|
Property-column mapping of the items you want to reconcile against. For example, {"P17": df['country']} to reconcile against items that have the property country equals to the values in the column country. This is optional and defaults to None. |
None
|
reconciliation_endpoint |
str
|
The reconciliation endpoint, defaults to the Wikidata reconciliation endpoint. |
'https://wikidata.reconci.link/en/api'
|
Returns:
Name | Type | Description |
---|---|---|
DataFrame | A Pandas DataFrame with the reconciled results. |
Raises:
Type | Description |
---|---|
ValueError
|
top_res argument must be one of either 'all' or an integer. |
Source code in reconciler/main.py
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 |
|