=> to build a common structure for the source data
I. heterogeneity
1. RICentities
Top 20 (in value) only-partner countries
Quantifying the issue
2. mirror flows discrepancies
Quantifying the issue
II. variability in time
number of reporting through time
number of flows though time
III. Reducing complexity
Groups desaggregation direct method
Groups desaggregation mirror method
City part ofs
group 'city/part of' flows by reporting, year, expimp, country_part_of
sum the flow values
create a new flow to the partner "country_part_of"
delete the original flows
delete generated flows when duplicates of existing ones from source
Quality tags
Generated flows are tagged, indicating the method used :
group_desaggregation_direct_diffYear
group_desaggregation_mirror_diffYear
city_part_of_partner_aggregation
Colonial areas
Correlates of War dataset
To desaggregate colonial areas
for each colonial areas
to manually define the composition
by selecting the possible colonies listing in COW dataset
the composition variations through time will be done automatically
transform the colonial areas into groups
IV. Quality measures
IV. network analysis
RICardo World Trade Web, 1834-1938
Department of Economic History Lund Universityresearch seminar - 10/10/2018
Paul Girard @paulanomalie
Béatrice Dedinger