wiki:PopulationGeneration
Last modified 8 years ago Last modified on 23/11/2009 12:27:22

Population Generation

Useful information for the use of the population generation program.


Input files

There are two types of input files: Village level and National level data.


National Level data files

ageInRange.dat

Maps the 6-bin based "Age of Referent" distribution to the 14-bin based ages distribution (village data such as pAge.dat).

For the "age of referent" the following bins are available:

15 <= X <= 24
25 <= X <= 29
30 <= X <= 49
50 <= X <= 59
60 <= X <= 74
75 <= X

Considering the first range (15 <= X <= 24), it contains 2 bins (15-19 and 20-24) of the 14-bin distribution. The file ageInRange.dat thus contains the probability associated with each "sub-bin". Thus for the Auvergne example file:

20.5910 0.4090
1 1
4 0.2446 0.2582 0.3030 0.1943
2 0.4746 0.5254
3 0.4467 0.3598 0.193
5 0.3886 0.3161 0.2487 0.0466 0.0000

The first column specifies the number of "sub-bins" that each of the bins contains. The first row for example contains 2 sub-bins (15-19 and 20-24). The second row specifies that the distribution contains only 1 sub-bin (25-29). The third row specified that the bin third bin (30 <= X <= 49) contains 4 sub-bins: 30-34, 35-39, 40-44, 45-49.

ageMotCh.dat

Probability of being of age X given the number of children. Multicolumn table containing a probability distribution. Given an individual I with X number of children, each row in the table represents the probability of falling in one of the different age bins.

Example from Auvergne data:

AgeBin ranges X=1X=2X=3X=4X=5X>=6
10..14 0.0518400.008210.001820.000600.000410.00012
15..19 0.217260.103140.057400.036210.027770.01474
20..240.415650.381290.265870.198360.168070.12134
25..290.216420.371100.425760.388990.359540.31410
30..340.081340.119350.217950.307730.335420.37814
35..390.016830.016420.030180.065430.101550.15728
40..440.000640.000460.001010.002650.006930.01353
>45 0.000020.000020.000020.000040.000320.00075

ecartAge.dat

Distribution of the difference of age in a couple.

Probability distribution of difference of ages between partners. Given two partners A and B, P is the probability that Partner A age is +-X years of partner B

Field name Field type Comments/Description
X Posiive or Negative Integer Age difference
P Integer Probability

Example (from Auvergne population):

-160.001700
-150.000600
-140.001100
-130.001000
-120.001300
-110.001500
-100.002400
-90.002900

monopProb.dat

Probability of being head of a monoparental HH according to age


Village Level data files

refAge.dat

distribution of age for the head of HH

For an individual A, distributions for the probability of falling in an age class.

In this file two different distributions are present, the first regarding HH with size=1 and the second for all the other sizes.

The 6 first entries correspond to the probability of a person living with other people to fall into a specific age class (there are 6 age classes defined).

The last 6 entries correspond to the probability single person to fall into a specific age class (there are 6 age classes)

hhSize.dat

Distribution of the Household size

Assuming that there are N different types of households, the file contains the total number of households of each of the N types.

In the Auvergne example, it is assumed that N=6 and the hhSize.dat file contains:

980
1292
600
456
148
44

pAge.dat

Probability of an individual falling into one of the pre-defined age bins. The defined age-bins for this and NAge are 20 in total (thus, these files have 20 records) representing the ages:

0-4
5-9
10-14
15-19
20-24
25-29
30-34
35-39
40-44
45-49
50-54
55-59
60-64
65-69
70-74
75-79
80-84
85-89
90-94
>95

NAge.dat

Similarly to pAge.dat, but NAge.dat contains the actual amount of people falling into each range of age. The number of bins are also 20, using the same classification as in pAge.dat

p_child.dat

Probability of living with parents for the different 14 age bins. The file contains one column of data containing the probability values for each bin. An example of bins and their corresponding probabilities from Auvergne:

15-190.067477442
20-240.0329541
25-290.009415457
30-340.003138486
35-390.001569243
40-440.003138486
45-490
50-540
55-590.001569243
60-640
65-690
70-740
75-790
>800

Note: The file contains only the second column (the probabilities).

p_monoparental.dat

Probability of being head of a monoparental household according to age.

DataMun.dat

Some general data about the municipality

Field name Field type Comments/Description
Population size Integer -
Number of households Integer -

Output File

The output of the code is:

  1. population.dat: 100 realization of the generation. Different realizations are seprated by the number "1000". It is used for data analysis.

The structure of the file is file contains one household for each line with the data:

HH type HHsize age Range of Head Age of each other persons
0168
0118
2285 85
2286 86
363434 2 9 9 0
0181
0166
223437
  1. population_1real.dat: 1 single realization… to be used directly in the model.

Constants values used

Household types
Correspond to the code used by StartPopulation when it generates a population
  • 0: single
  • 1: monoparenthal family
  • 2: couple without children
  • 3: couple with children
  • 4: complex household (other households)