 |
"A place for everything, everything in its place."
— Benjamin Franklin |
Before You Begin
|
While data collection can
be rewarding, much work is required to be sure it is
done well so that the data you collect truly
reflect the impact of your program. To ensure that this
process is organized and credible, do the following
before you begin. Click on each for more.
-
-
-
-
-
-
-
|
|
| 1.
Identify Existing Data Source(s)
Before you decide how to collect the data, you will
need to decide where the data are going to come from
and whether you must obtain new information, or whether
you can use existing data (information that others have
collected, usually for other purposes). Sometimes, state
or local survey data or data that are part of another
project will include the information you need. In other
instances, program files or other records, public or
private, may contain the information.
Some data, like participants' attitudes toward the
program, will come directly from your program participants.
Other data, like the proportion of minority youth in
the community who smoke before and after the program,
may be available from state surveys, or other existing
data sources.
When appropriate, there are advantages to using existing
data rather than going to the expense and energy of
collecting new information. To determine whether it
is even possible to use existing data, consider each
of the following:
- Can the existing data be obtained for my evaluation
purposes?
- How well can the existing data answer my evaluation
questions?
- How well do the existing data represent my target
population?
- How available are the existing data?
If the data can be obtained, can answer your
evaluation questions, can represent your target population,
and are available, consider yourself fortunate. Now
you must decide whether it will be better to use these
existing data or collect new data. To make this decision,
ask yourself these three questions:1
- What data sources, existing or new, are likely to
provide the most accurate information for this target
population (see Determine Credibility
of Existing Data)?
- What data sources will allow the least costly and
most rapid data collection?
- Will the collection of information pose an undue
burden on the existing sources?
The answers to these three questions must be balanced.
Clearly, accuracy is desirable, but at what cost in
terms of money and time? And, how much burden are the
existing sources willing to tolerate? For example, suppose
the data you wanted could be obtained from files. Due
to privacy, only designated people may be able to go
through the files, and the task of data collection might
add greatly to the burden upon these individuals. Conversely,
if it were possible to use outside data collectors,
the presence of these persons as they are going through
the files may be disruptive to the setting in which
the files are located.
Top
|
|
2. Determine Credibility of Existing Data
Tip:
It's especially important to consider how credible
each source will be to the persons to whom you
intend to distribute the results of your evaluation
(e.g., the funders). |
Having the most accurate data sources possible is the most important
factor for any successful evaluation. But, what makes one source more
credible than another? Sometimes it's difficult to know. For example,
carefully maintained records may be more accurate than the memory of an
individual. On the other hand, overworked professionals may not attend to
detail when recording information, especially if the record systems were
not intended to provide evaluation data. Furthermore, different persons
may record differently, creating inconsistencies in the record system.
There are two ways to assess the accuracy of data: Reliability and Validity.
Data Reliability is a measure of the degree to which
the data can be reproduced, or replicated. For example, suppose two different
members of the evaluation team reviewed the congressional record to determine
a legislator's record of support for smoke-free environment legislation. If
they both produced the same counts regarding instances of support, the data
have reliability, specifically inter-rater reliability.
Data Validity is a measure of the degree to which the
data actually measure what they are intended to measure. If the records
of a vote indicate that a legislator voted against the smoke-free environment
legislation, and, when asked, she confirms that she voted against it, the
records have been validated. Conversely, if medical records indicate that
12 percent of pregnant Latina teens smoke, but 18 percent report smoking
when interviewed, the medical records data have not been validated.
Other Credibility Factors
In addition to validity and reliability, several other
factors contribute to the credibility of a data source.
One such factor is whether or not those who collected
the data have political, or other interests, that might
be likely to influence the data collection methods or
results.
Another factor is the reputation of the organization collecting the
information among tobacco prevention and control advocates. For example,
data from the Centers for Disease Control and Prevention (CDC) or the National
Institutes of Health are viewed as highly credible. Efforts are made to see
that their data are accurate, and both organizations are well known. In
contrast, data from medical records can be suspect as they are rarely validated,
and often collected for treatment purposes.
Top
|
|
3. Determine the Best Method for Collecting Data
|
Tip:
Using mixed methods increases the cross-checks
on different subsets of findings and generates
increased stakeholder confidence in the overall
findings. In addition, combining methods provides
a way to triangulate findings, which maximizes
the strengths and minimizes the limitations of
each method. Using mixed methods enables you to
validate your findings, enhance reliability, and
build a more thorough evaluation for improving
program effectiveness.2 |
Once you have determined whether your data will come
from new or existing sources, it is important to select
the data collection methods that will be the most appropriate
for achieving your evaluation objectives. If you are
using existing data sources, the main questions will
be "What information do I select from the source?"
and "How do I record it for your evaluation purposes?"
When you are collecting new data, there will be many
more decisions to make. The first of these is whether
to use qualitative methods, quantitative methods, or
both. In the Evaluation Planning section, we
introduced these two types of data collection. Let's
review.
|
Qualitative
methods are open-ended and allow the evaluator unlimited scope for
probing the feelings, beliefs, and impressions of the people participating
in the evaluation, and to do so without prejudicing participants with the
evaluator's own opinions. They also allow the evaluator to judge the intensity
of people's preference for one item or another. Such methods include:
individual interviews
observation
focus groups
|
Quantitative
methods are ways of gathering objective data that can be expressed in
numbers (e.g., a count of the people with whom a program had contact or
the percentage of change in a particular behavior by the target population).
Unlike the results produced by qualitative methods, when correctly gathered,
the results produced by quantitative methods can be used to draw conclusions
about the target population. Such methods include surveys using:
respondent-completed instruments (e.g., questionnaires)
administered by direct distribution or through the mail, web, or e-mail
-
interviewer-completed instruments
administered face-to-face or through telephone
surveys
|
|
Choosing the method best suited for your audience and objective(s) requires
an understanding of what each method can reveal about your program. Here are some
examples of when each method might be used in a program to promote smoke-free
environments.
| If... |
Then... |
Example |
| .you want to learn the full range of reasons for program
participants' choices, and the topic is not sensitive |
.conduct focus groups |
How participants decided which public information session to attend. |
| .you want to learn the full range of reasons for program participants'
choices and either (1) the topic may be considered sensitive, or (2) it is difficult to
gather program participants in one location. |
.conduct qualitative individual interviews |
Two examples include:-
Whether participants plan to vote
for or against a smoke-free environment, and why.
-
Participants are spread out over
a large rural area.
|
| .you already know the range of potential responses, and you want to
determine which response is the most common in the target population. |
.administer a questionnaire |
At which of the following locations they
would be most likely to attend a public information session;
or, how likely they would be to attend a public information
session-from not at all likely to very likely |
| .you want to conduct the survey quickly and the people in your target
population all have telephones and internet access. |
.administer a questionnaire by telephone or the web |
You want to determine whether legislators plan to support the Clean
Indoor Air bill coming up for a vote in two days. |
| .you want to conduct a survey, but some of the people
in your target population do not have telephones. |
.administer a questionnaire by mail, or by a face-to-face survey done
by going door-to-door, or to a public place (e.g., shopping mall) where members of the
population will be present |
You want to determine the number of persons in a predominantly Hispanic
low income community who would be willing to attend a smoking cessation program conducted
in Spanish. |
| .you want to conduct a survey, but some of the people in your target
population have low literacy or have difficulty reading (e.g., uneducated, visually
impaired, older) |
.administer a questionnaire by telephone or face-to-face
by going door-to-door or to a public place (e.g., intersection) where members of the
population will be present |
You want to determine the effectiveness of your program in reaching older
adults with diabetes, many of whom are visually-impaired. |
| .you want to conduct a survey and the topic may be considered sensitive |
.administer a questionnaire by telephone, or a
respondent-completed questionnaire to ensure anonymity |
Asking underage minority teens about smoking habits. |
Tip:
While the method you choose to collect data depends on the source, it
also depends on the evaluation type and design (see Evaluation Planning:
Evaluation Type and Design). For example, formative evaluations often
call for qualitative information, collected in the form of conversations.
On the other hand, outcome evaluations usually call for quantitative
information, collected through surveys conducted before and after exposure
to the program. |
In choosing a data collection method, the CDC recommends
that you consider the following:3
-
The purpose of the evaluation
Which method seems most appropriate for your purpose and the questions that you want to answer?
-
The users of the evaluation
Will the method allow you to gather information that can be analyzed and presented in
a way that will be seen as credible by your intended audience? Will they want standardized
quantitative information from a data source such as the Adult Tobacco Survey, or descriptive,
narrative information from "real people", or both?
-
The respondents from whom you will collect the data
Where and how can respondents best be reached? What is culturally appropriate? For example,
is conducting a phone interview or a more personal, face-to-face interview more appropriate
for certain population groups?
-
The resources available (time, money, volunteers, travel expenses,
supplies)
Which method(s) can you afford and manage well? What is feasible? Will your evaluation be
completed in time for the next legislative session or prior to the end of the school year?
Consider your own abilities and time. Do you have an evaluation background or will you have
to hire an evaluator? Do program funds and relevant policies allow you to hire external
evaluators?
-
The degree of intrusiveness-interruptions to the program or participants
Will the method disrupt the program or be seen as intrusive by the respondents? Also consider
issues of confidentiality if the information that you are seeking is sensitive.
-
Type of information
Do you want representative information that applies to all participants (standardized
information such as that from a survey, structured interview, or observation checklist
that will be comparable nationally and across states)? Or, do you want to examine the
range and diversity of experiences, or tell an in-depth story of particular people or
programs (e.g., descriptive data as from a case study)?
-
The advantages and disadvantages of each method
What are the key strengths and weaknesses in each?
Consider issues such as time and respondent burden,
cost, necessary infrastructure, access to sites
and records, and overall level of complexity. What
is the most appropriate for your evaluation needs?
Top
|
|
4. Select or Create Data Collection Instruments
|
An important part of data collection is using the same
data collection instrument to get information from all
of your information sources. If you are gathering information
from files or other records, rather than from people,
the instrument may be a form designed so that you can
record all of the information needed from each file.
This is called a data abstraction form. In
the case of interviews and focus groups, the instrument
may be a general outline of topics to be covered. This
is known as an interview/focus group guide.
In the case of surveys, the instrument is usually composed
of a series of carefully-worded questions or statements
for the respondent to answer. Often the answers are
selected from answer choices that are provided on the
instrument.
Creating a good data collection instrument can be difficult and time-consuming.
For that reason, before you create a new instrument you should see if a suitable
instrument already exists.
|
Whether you decide to use an existing instrument or
to develop your own, the instrument you use should meet
the following criteria.4
It should include questions that can be used to measure the concepts addressed
or affected by your program (e.g., knowledge of tobacco prevention methods).
It should be appropriate for your participants in terms of age or developmental
level, language, and ease of use. Questions should be written in simple and
easy-to-understand language. These characteristics can be checked by conducting focus
groups of participants or pilot testing the instruments.
It should respect and reflect the participants' cultural backgrounds. The
definitions, concepts, and items in the instrument should be relevant to the
participants' community and experience.
-
It should be possible to complete
in a reasonable timeframe. Again, pilot testing
can reveal these issues.
Creating Your Own Instrument
If you choose to create your own instrument, there are many evaluation, assessment,
and testing web sites and sources that can help you get started. Here are just a few.
Ensuring the Effectiveness of Instruments and Procedures
Even after you have selected or created the instruments,
you are not yet ready to begin collecting data. Before
you can actually begin collecting evaluation information,
you must pilot test your instruments
and procedures. This means trying out the instruments
on a small number of people from your target population,
or from a group of participants from a similar project
offered by your program or a program with a similar
purpose.
The pilot test will determine whether the instruments:
Obtain all the information you need for the evaluation,
Are of the correct reading level for the target population,
Are understood by the target population,
Are not too lengthy or difficult for the respondents,
Are culturally appropriate for the target population, and
Are effective overall.
In most cases, you will need to make at least some changes the instruments
as a result of the pilot test. If you do, it is better to have learned on a small
number of people than to have wasted your evaluation resources collecting data with
instruments that were not appropriate. Occasionally, you will not need to make any
changes. If you use a sub-sample of your target population for the pretest and the
instruments do not have to be changed, you can combine the data from this sub-sample
into the actual evaluation data.
The kinds of information that can be obtained from a pilot test include:
How long it takes to complete interviews or fill out each
instrument
Whether each of the questions was clear
Whether the instruments can be completed by participants
without assistance from staff
Whether the necessary information can be collected in the
established time frame
Whether instruments and procedures are culturally appropriate
Whether the notification procedures (letters, informed consent,
and the like) are easily implemented and executed
For more about pilot testing your data collection instruments,
see chapter 7
of The Program Manager's Guide to Evaluation.
Top
|
|
5. Determine Instrument Validity and Reliability
The instrument you choose to collect data must be able
to collect the information you need to answer your questions
about your program. Just as with data, the instrument
must be valid and reliable.
Instrument Reliability is a measure of the degree to which an
individual's responses are reproducible, or consistent, both over time and within the
instrument. For example, suppose a question at the beginning of a data collection
instrument asked, "Do you support increased tobacco taxes?" and 67% of respondents said,
"Yes". Suppose that another question, toward the middle of the same instrument asked,
"Are you in favor of increasing taxes on tobacco?" and 66% of respondents said, "Yes".
This would suggest that the data were reliable. On the other hand, suppose data from a
statewide web-based survey during the first week of April indicated that 65% of restaurant
owners had a high level of knowledge about clean indoor air ordinances. If data from the
same survey, conducted during the second week of April, indicated that only 35% of restaurant
owners had a high level of knowledge about these ordinances, the data would not reliable
(i.e., reproducible).
Instrument Validity is a measure of the degree to which the instrument
actually measures what it is intended to measure. If the instrument is supposed to measure
knowledge about tobacco prevention and control, then we would expect people who attended a
tobacco use prevention course to get higher scores than people who did not attend such a
course. If they do, this is evidence that the instrument is valid. If they do not, this
suggests that the instrument is not valid.
There are three main types of validity: face validity,
content validity, and construct validity.
-
Face validity
is the degree to which the instrument appears
to measure what it is intended to measure. Face
validity is important for good rapport between interviewer
(questioner) and respondent. If the interviewer
informs the respondent that the survey is about
smoking around children, but the respondent believes
it is about something else, the respondent may distrust
the evaluator's intent and may refuse to answer
or may not answer truthfully. Assess face validity
through pilot tests (e.g., focus groups or personal
interviews with a subgroup of the target population)
and by having subject-matter experts review the
questionnaire.
-
Content validity
is the degree to which all relevant aspects of the
topic being addressed are covered by the survey
instrument. Assess content validity by having subject-matter
experts review the content of the instrument.
-
Construct validity
is the degree to which the survey instrument accurately
measures what it is supposed to measure. The easiest
way to establish construct validity is to compare
the results obtained using your instrument with
results from a similar instrument that has already
been shown to be valid. You can do this by administering
both instruments to a group of people who are similar
to, or selected from, the target population. If
the results of your instrument correlate with those
of the existing instrument, it demonstrates the
validity of your instrument.
Top
|
|
6: Determine How Much Data to Collect
How much data you collect will depend upon the balance between your needs and
the resources available. If you need to be able to draw conclusions about a number
of sub-populations (e.g., various racial or ethnic groups) you will need more
information than if you are only interested in the population as a whole. On the
other hand, how many of the race/ethnic subpopulations you can draw conclusions about
may be limited by the availability of funds, or staff who can speak the language.
There are two ways to think about the quantity of information you will need to collect:
How many questions do you need to ask?
The questions you ask should be restricted to those
that are necessary to answer your evaluation questions
with an acceptable level of detail. While it can be
exciting to get lots of information, this places a burden
upon the people who are providing the answers. In general,
the more questions you ask, the less people will be
willing to take the time to provide complete information.
If you want a good response from your participants,
keep it short and simple. In addition, long interviews
and questionnaires also take longer for data processing
and analysis. This will add to the cost of your evaluation.
For more about how to write survey questions see Basics
of Developing Questionnaires at http://www.mapnp.org/library/evaluatn/questnrs.htm
How many people do you need information from?
How many people you need to ask (i.e., the sample size) depends, again, on the level
of detail you are interested in, as well as the types of comparisons you want to make.
You will also need to determine the level at which you want to be able to draw conclusions
(e.g., state, county, region, congressional district). Counties often appreciate and want
county-level estimates; however, this usually means larger sample sizes and more expense.
Your study must have a certain minimum quantity of data to
detect a specified change produced by your program.
In general, detecting small amounts of change requires
larger sample sizes. For example, detecting a 5% increase
would require a larger sample size than detecting a
10% increase. If you use tobacco data sources such as
the Youth Tobacco Survey, the sample size has already
been determined.3
If you are designing your own evaluation tool, you
will need the help of a statistician to determine an
adequate sample size. The three pieces of
information the statistician will want to know are:
What is your outcome measure? In particular, are you
interested in averages (like average knowledge about the adverse effects of tobacco) or in
the percentages of people in designated categories (like the percent who support Smoke-free
Environments)?
How much does this outcome measure vary? For example,
are most people pretty similar in their level of knowledge, or does it vary a lot from person
to person? [NOTE: If your outcome measure has been used before, the statistician will LOVE
you if you are able to find a value called the "standard deviation" of the measure.]
How much difference do you want to be able to detect? If
your program increases average knowledge by 1 point, is that enough for you to care about or
does it need to be at least 5 points?
Top
|
|
7. Establish Procedures for Collecting Data
Once you decide what type of instrument you will use
to collect evaluation information, you must establish
a set of procedures to ensure that this information
will be collected in a consistent and systematic manner,
especially if you are going to be making comparisons
among groups. For this reason, standard data collection
procedures should be established, and data collectors
should be trained in using these procedures.
Everyone involved in collecting evaluation information
must be trained in these procedures:
-
What instrument will be used to collect the information? Each
data collector should be trained in the use of the instruments. If there is more than one instrument,
then the order in which they are to be used should be made clear.
-
When will you collect the information? The timeframe
during which the data are to be collected must be clearly specified. There is some information
that may need to be collected before the program starts and other information that needs to be
collected at the end of the program. Having the timeframe spelled out will ensure that the
information is collected as scheduled.
-
Where will you collect the information?
As discussed earlier, you will need to determine
the sources from which the information will be collected.
In some instances you may be using program records,
while in other instances you may be relying on participants'
coming to a specific location to complete the survey
instrument or to participate in a group discussion
about their experiences. When the data collected
are sensitive, the location is especially important,
and should allow privacy. You will need to determine
where you will collect the information and convey
this to program participants.
-
Who will collect the information? This responsibility must be
clearly specified or you will risk having some information collection activities fall through the
cracks. In some situations you will need to be sure that information collectors meet certain criteria.
For example, they may need to be familiar with the culture or the language of the individuals they
are interviewing or observing. If the questionnaire is being administered by interviewers (for example,
residents hired and trained to conduct interviews), those persons must be properly trained to
administer the questionnaire. Training will ensure that the interviewers are familiar
with the survey instrument.
Top
---------------
1. Source: U.S. Department of Housing
and Urban Development, Office of Policy Development
and Research. (1997). How do you get the information
you need for your evaluation? In A guide to evaluating
crime control of programs in public housing. Washington,
DC: Prepared for the U.S. Department of Housing and
Urban Development by KRA Corporation (pp. 6.1-6.15).
2. Source: Bond, S.L., Boyd, S.E., &
Rapp, K.A. (1997). Taking stock: A practical guide
to evaluating your own programs. Chapel Hill, NC:
Horizon Research.
[Referenced in: Centers for Disease Control and Prevention.
(2001). Gather credible evidence. In Introduction
to program evaluation for comprehensive tobacco control
programs (pp. 49-56). Atlanta, GA: the Author.
3. Source: Centers for Disease Control
and Prevention. (2001). Gather credible evidence. In
Introduction to program evaluation for comprehensive
tobacco control programs (pp. 49-56). Atlanta,
GA: the Author.
4. Source: Child Outcomes Research and
Evaluation Team. (n.d.). How do you get the information
you need for your evaluation? In The program manager's
guide to evaluation. Retrieved April 12, 2004 from
the Administration for Children and Families web site.
|
|
 |