Population Generation
Useful information for the use of the population generation program.
Table of Contents
Input files
There are two types of input files: Village level and National level data.
National Level data files
ageInRange.dat
Maps the 6bin based "Age of Referent" distribution to the 14bin based ages distribution (village data such as pAge.dat).
For the "age of referent" the following bins are available:
15 <= X <= 24 
25 <= X <= 29 
30 <= X <= 49 
50 <= X <= 59 
60 <= X <= 74 
75 <= X 
Considering the first range (15 <= X <= 24), it contains 2 bins (1519 and 2024) of the 14bin distribution. The file ageInRange.dat thus contains the probability associated with each "subbin". Thus for the Auvergne example file:
2  0.5910  0.4090  
1  1  
4  0.2446  0.2582  0.3030  0.1943  
2  0.4746  0.5254  
3  0.4467  0.3598  0.193  
5  0.3886  0.3161  0.2487  0.0466  0.0000 
The first column specifies the number of "subbins" that each of the bins contains. The first row for example contains 2 subbins (1519 and 2024). The second row specifies that the distribution contains only 1 subbin (2529). The third row specified that the bin third bin (30 <= X <= 49) contains 4 subbins: 3034, 3539, 4044, 4549.
ageMotCh.dat
Probability of being of age X given the number of children. Multicolumn table containing a probability distribution. Given an individual I with X number of children, each row in the table represents the probability of falling in one of the different age bins.
Example from Auvergne data:
AgeBin ranges  X=1  X=2  X=3  X=4  X=5  X>=6 
10..14  0.051840  0.00821  0.00182  0.00060  0.00041  0.00012 
15..19  0.21726  0.10314  0.05740  0.03621  0.02777  0.01474 
20..24  0.41565  0.38129  0.26587  0.19836  0.16807  0.12134 
25..29  0.21642  0.37110  0.42576  0.38899  0.35954  0.31410 
30..34  0.08134  0.11935  0.21795  0.30773  0.33542  0.37814 
35..39  0.01683  0.01642  0.03018  0.06543  0.10155  0.15728 
40..44  0.00064  0.00046  0.00101  0.00265  0.00693  0.01353 
>45  0.00002  0.00002  0.00002  0.00004  0.00032  0.00075 
ecartAge.dat
Distribution of the difference of age in a couple.
Probability distribution of difference of ages between partners. Given two partners A and B, P is the probability that Partner A age is +X years of partner B
Field name Field type Comments/Description X Posiive or Negative Integer Age difference P Integer Probability
Example (from Auvergne population):
16  0.001700 
15  0.000600 
14  0.001100 
13  0.001000 
12  0.001300 
11  0.001500 
10  0.002400 
9  0.002900 
monopProb.dat
Probability of being head of a monoparental HH according to age
Village Level data files
refAge.dat
distribution of age for the head of HH
For an individual A, distributions for the probability of falling in an age class.
In this file two different distributions are present, the first regarding HH with size=1 and the second for all the other sizes.
The 6 first entries correspond to the probability of a person living with other people to fall into a specific age class (there are 6 age classes defined).
The last 6 entries correspond to the probability single person to fall into a specific age class (there are 6 age classes)
hhSize.dat
Distribution of the Household size
Assuming that there are N different types of households, the file contains the total number of households of each of the N types.
In the Auvergne example, it is assumed that N=6 and the hhSize.dat file contains:
980 
1292 
600 
456 
148 
44 
pAge.dat
Probability of an individual falling into one of the predefined age bins. The defined agebins for this and NAge are 20 in total (thus, these files have 20 records) representing the ages:
04 
59 
1014 
1519 
2024 
2529 
3034 
3539 
4044 
4549 
5054 
5559 
6064 
6569 
7074 
7579 
8084 
8589 
9094 
>95 
NAge.dat
Similarly to pAge.dat, but NAge.dat contains the actual amount of people falling into each range of age. The number of bins are also 20, using the same classification as in pAge.dat
p_child.dat
Probability of living with parents for the different 14 age bins. The file contains one column of data containing the probability values for each bin. An example of bins and their corresponding probabilities from Auvergne:
1519  0.067477442 
2024  0.0329541 
2529  0.009415457 
3034  0.003138486 
3539  0.001569243 
4044  0.003138486 
4549  0 
5054  0 
5559  0.001569243 
6064  0 
6569  0 
7074  0 
7579  0 
>80  0 
Note: The file contains only the second column (the probabilities).
p_monoparental.dat
Probability of being head of a monoparental household according to age.
DataMun.dat
Some general data about the municipality
Field name  Field type  Comments/Description 
Population size  Integer   
Number of households  Integer   
Output File
The output of the code is:
 population.dat: 100 realization of the generation. Different realizations are seprated by the number "1000". It is used for data analysis.
The structure of the file is file contains one household for each line with the data:
HH type HHsize age Range of Head Age of each other persons 0 1 68 0 1 18 2 2 85 85 2 2 86 86 3 6 34 34 2 9 9 0 0 1 81 0 1 66 2 2 34 37
 population_1real.dat: 1 single realizationâ€¦ to be used directly in the model.
Constants values used
 Household types

Correspond to the code used by StartPopulation when it generates a population
 0: single
 1: monoparenthal family
 2: couple without children
 3: couple with children
 4: complex household (other households)