DEFINITION
The cross-sectional
study, also called the prevalence study or naturalistic sampling, has the objective of determination of prevalence of risk
factors and prevalence of disease at a point in time (calendar time or an event like birth or death). Disease and exposure are ascertained simultaneously. A cross-sectional study can be descriptive or analytic
or both. It may be done once or may be repeated. Individual-based studies collect
information on individuals. Group-based (ecologic) studies collect aggregate information about groups of individuals. Cross-sectional
studies are used in community diagnosis, preliminary study of disease etiology, assessment of health status, disease surveillance,
public health planning, and program evaluation. Cross-sectional studies have the advantages of simplicity, and rapid execution
to provide rapid answers. Their disadvantages are: inability to study etiology because the time sequence between exposure
and outcome is unknown, inability to study diseases with low prevalence, high respondent bias, poor documentation of confounding
factors, and over-representation of diseases of long duration.
DESIGN AND DATA
COLLECTION
The study may
be based on the whole population or a sample. It may be based on individual sampling units or groups of individuals. The study
sample is divided into 4 groups: a = exposed cases, b = unexposed cases, c = exposed noncases, and d = unexposed noncases.
The total sample size is n = a + b + c + d; n is the only quantity fixed before data collection. The marginal totals are n1
= a+b, n0 = b+d, m1 = a+b, and m0
= c+d. None of the marginal totals is fixed. Sampling methods can be simple random
sampling, cluster sampling, systematic sampling, and multi-stage sampling. Sample size is determined using specific formulas.
Cases are identified from clinical examinations, interviews, or clinical records. Data is collected by clinical examination,
questionnaires, personal interview, and review of clinical records.
STATISTICAL PARAMETERS
The following
descriptive statistics can be computed from a cross-sectional study: mean, standard deviation, median, percentile, quartiles,
ratios, proportions, prevalence of the risk factor, n1/n, and the prevalence of the disease, m1/n. The
following analytic statistics can be computed: correlation coefficient, regression coefficient, odds ratio, and rate difference.
The prevalence difference is computed as p1 – p0 = a/n1 - b/n0. The prevalence
ratio is computed as p1/p0 = (a/n1) / (b/n0). The prevalence odds ratio is computed
as POR = {p1(1 - p1)} / { p0(1 - p0)}.
ECOLOGIC DESIGN
Ecological
studies, exploratory or analytic, study aggregate and not individual information. Groups commonly used are schools, factories,
and countries. Exposure is measured as an overall group index. Outcome is measured as rates, proportions, and means. The correlation and regression coefficients are used as effect measures. The advantages
of ecological studies are: low cost, convenience, easy analysis, and interpretation. They have several weaknesses. They generate
but cannot test hypotheses. They cannot be used in definitive etiological research. They suffer from ecological fallacy (relation
at the aggregate is not true at the individual level). They lack data to control for confounding. Data is often inaccurate
or incomplete. Collinearity is a common problem.
HEALTH SURVEYS
Surveys
involve more subjects than the usual epidemiological sample are used for measurement of health and disease, assessment of
needs, assessment service utilization and care. They may be population or sample surveys. Planning of surveys includes: literature
survey, stating objectives, identifying and prioritizing the problem, formulating a hypothesis, defining the population, defining
the sampling frame, determining sample size and sampling method, training study personnel, considering logistics (approvals,
manpower, materials and equipment. finance, transport, communication, and accommodation), preparing and pre-testing the study
questionnaire. Surveys may be cross sectional or longitudinal. The household is the usual sampling unit. Sampling may be simple
random sampling, systematic sampling, stratified sampling, cluster sampling, or multistage sampling. Existing data may be
used or new data may be collected using a questionnaire (postal, telephone, diaries, and interview), physical examinations,
direct observation, and laboratory investigations. Structure and contents of the
survey report is determined by potential readers. The report is used to communicate information and also apply for funding.
|