Author: Marshall D. Sundberg

Department of Plant Biology

502 Life Sciences

Louisiana State University

Baton Rouge, LA 70803-1705

ph (504) 388-8563 FAX (504) 388-8429

btmarsh@lsuvm.sncc.lsu.edu or sundberg@life.jsc.nasa.gov

Title: Hypothesis testing and the scientific method

Keywords: hypothesis testing, sampling, replication, descriptive statistics

Credits: This activity was developed with support of NSF Grant BBS-9155920 to MDS.

Abstract: Cooperative group learning is used in this non-threatening introduction to hypothesis testing and the scientific method. Students are introduced to the problem of which automobile (or clothing, bookbags, umbrellas, etc.) dealership might be most profitable at a site near campus. An initial survey and vote by the class determines the hypothesis to be tested - the most common make. Further discussion generates a list of possible ways to test the hypothesis. Sampling is encouraged as the method of choice which leads to a discussion of anticipated problems and limitations of various sampling techniques. When a concensus on approach is reached, the class is divided into research teams and given a specified time period to collect data and return to pool data by the end of the class period. The homework assignment is to calculate descriptive statistics for the four most frequently sampled models in the pooled data. Results and limitations are discussed at the beginning of the following period. Finally, the procedures used in this activity provide a framework for introducing the technique of concept mapping as both a research and study skill.

FAIR USAGE

This exercise may be freely copied and disseminated for all non-commercial educational activities provided that appropriate credit is given to the author, this source (BIOLAB BBS), and NSF grant USE-9156094. Its use is explictly permitted in laboratory manuals and compiled class handouts sold at or below the cost of printing or duplication.

INSTRUCTOR'S GUIDE

BACKGROUND INFORMATION

Investigative laboratories will probably be very different for your students from those in any previous science course most of them have taken. It may be very frustrating for them, at first, because they will not have specific directions to follow or lists of terms and concepts to learn. Instead, you will have to force them to become self-reliant in making observations, forming questions, proposing hypotheses, in particular null hypotheses, designing tests of their hypotheses and analyzing and interpreting the results of their tests. THEY ARE EXPECTED TO DO THE WORK. IT IS YOUR JOB TO GUIDE THEM, BUT NOT TO TELL THEM WHAT TO DO!!

OBJECTIVES

There are two primary objectives of this activity. First is to introduce and reinforce the concept of hypothesis testing as an extension of common sense problem solving. The second is to introduce the technique of concept mapping. There are three secondary objectives. First, is to introduce sampling techniques and the concept of replication. Second is to introduce basic descriptive statistics as a means of summarizing data. Third is to develop a sense of teamwork and cooperation among small research teams of students.

CONCEPTS AND RELATED CONCEPTS ACCURACY

When making measurements or collecting data one hopes to obtain estimates which approach the true value of the unknown. An accurate measurement is one which does this. It is important to note, however, that accuracy is not the same thing as precision (see PRECISION). An accurate measurement is not necessarily precise.

BIAS

A bias is a preconception which can influence your ability to make observations or to interpret data. Try to recognize what your biases might be when designing an experiment, then design your experiment in such a way as to avoid as many biases as possible.

CONCEPT MAPPING

Concept mapping is a special learning tool based on the supposition that knowing is a process of making mental maps of ideas and relationships. This tool allows students to visually record the relationships between concepts. For example, the concept of a wooden pencil can be mapped as follows:

The task for the student is first to identify the key concepts (wood, "lead", graphite, clay, etc.), and the way these concepts are related (called the proposition, which in this case includes the verbs made of, composed of, surrounding, etc.) Long or wordy propositions are hard to remember and often unclear. Most of your time will be spent trying to decide on just the right word that best describes the relationship between the concepts. The lines that are broken, and point across the map are called cross-links. Cross-links represent insight. Because the concepts are represented visually, students can see all of the concepts simultaneously. This creates new opportunities for generating new knowledge. Such cross-links represent a new depth of understanding. Look for cross-links once you have completed a C-map, since what you might think is an insight could very well be a solid relationship.

Concept mapping is hierarchical. This means that the most general concept (the superordinate), is placed at the top of the map. All other concepts that make up or are related to the superordinate are placed below. Ideally, all C-maps (concept map(s)) are grounded with examples at its bottom-most level (cedar, carpenters glue). As a rule, concepts are placed in solid lined figures and examples are placed in broken line figures.

Interestingly enough, concept maps are based on the personal knowledge of individual students. The implications of this are important. Different students reading the same material will have different concept maps. The differences can be slight or radical, but none could be considered wrong. After all, they represent the way that a person understands the given concepts a the given time. On the other hand, these personal maps do change over time, especially when the person obtains new information.

How to make a C-map:

1. From the reading, extract the key concepts.

2. Rank order these concepts from general to specific.

3. Relate each of these concepts in a branching pattern with short propositions. Be sure to start with the superordinate at the top and end with examples at the bottom.

4. Look for crosslink insights.

5. Be sure to include examples - i.e. go through reading and underline the key words, then list this on separate slips of paper.

How to evaluate a C-map: Ask yourself the following questions:

1. Is there a valid hierarchy?

2. Are there valid relationships?

3. Are there valid examples?

4. Are the concepts in a branching pattern or is it stringy?

5. Are there opportunities for crosslinks?

CONTROL

When designing an experiment you want something to compare your results against - this is the control. For instance, if you are interested in the effect of a particular color of light on the rate of photosynthesis, you would design an experiment using particular colored light filters. As controls you would used white light (all colors combined) and complete dark (no light). By comparing your experimental results against the controls, you would be able to determine if a particular color had any effect. When designing your experiment, be sure to keep all variables constant, except the one you are interested in examining. For instance, in an effect of light quality (color) experiment, all treatments, including the control, would be run at the same temperature, with the same amount of nutrients, etc.

ERROR

This term has a common usage which is equally applicable in biology -- making a mistake. In science, however, there are other, more specific definitions. When analyzing the results of an experiment, we are interested in the experimental error - that error due to the specific design of the experiment or the equipment used. For instance, if we are using a thermometer to record temperatures during an experiment, and the instrument was out of calibration, our results will be consistently in error. Another example of experimental error would arise if different members of a group make successive readings of an instrument and they do not read consistently. For instance, one person may misread the meter by not viewing it direct on while another person may round off readings. A third person may interpolate (estimate a fraction between two numbers on the scale) to one additional significant figure, while a fourth may interpolate two additional significant figures. When the data from this group is pooled, there will be an inherent experimental error due to the different instrument reading techniques of the individual group members. Additional errors commonly encountered in biology have to do with statistical testing.

We use statistical tests to help us determine if observed results are sufficiently close to the expected results to be accepted or sufficiently different to be rejected. A type I error is when we reject a null hypothesis (see HYPOTHESIS) when it is true. The probability of making a type I error is the level of significance of a statistical test. A type II error is when we accept a null hypothesis when, in fact, the alternative is true. We can avoid a type I error by making our statistical test more rigorous - but of course, that increases the likelihood of making a type II error!

EXPERIMENTAL DESIGN

Experimental design deals with selection of variables to be studied and the choice of a sampling program. It does not deal with experimental techniques used to gather data. The most commonly used experimental design is the two-sample comparison. To do this you select two situations in which all conditions but one are the same. One situation, usually more "normal" serves as the control and is the basis for comparison. The other situation is the experimental in which you vary the factor of interest. By comparing data obtained from the experimental situation with the control, you can make some conclusions about the effect of the variable you altered on the organism being studied. Be careful, though, to consider other possible explanations. For instance, you may do an experiment growing plants under two different lighting conditions. The control may be normal daylight, and the experimental may have additional supplemental lighting from a floodlight. Plants in the latter situation may grow more rapidly. This may seem to support the hypothesis that the greater the amount of light, the faster a plant will grow. The supplemental floodlight certainly will provide more light intensity - but it also provides heat. Is it the light, the heat, or both that made the experimental plants grow faster?

HYPOTHESIS

An hypothesis is a tentative statement or assumption which is made in order to be tested. To formulate a hypothesis is to make a testable prediction about the relationship between variables. A hypothesis is usually stated before any sensible investigation or experiment is performed because the hypothesis provides guidance to an investigator about the data to collect. A hypothesis is an expression of what the investigator thinks will be the effect of the manipulated (independent) variable on the responding (dependent) variable. A workable hypothesis is stated in such a way that , upon testing, its credibility can be established or refuted. Hypotheses can usually be formed as an "if...then" statement.

Because it is not possible to prove a hypothesis scientifically, in the same way a mathematician can prove a theorem, scientists frequently phrase their hypothesis as a null hypothesis (H:O), in opposition to an alternative hypothesis (H:A). A null hypothesis is simply a statement of "no difference" between the experimental and control. If there is a difference, we must reject the null hypothesis and accept the alternative. The concept of hypothesis testing is basic to all of science; it is also the most misunderstood by the public. The word "prove" should not exist in the scientist's vocabulary, an infinite number of examples are needed to prove a hypothesis. No matter how much evidence we gather to support a specific hypothesis, we can never be certain that the same data would not equally support any number of unknown alternative hypotheses. On the other hand, only one piece of evidence is necessary to disprove, and thus reject, a null hypothesis. If we demonstrate that the null hypothesis is invalid, then the alternative must be true - although, in fact, this simply means we must rexamine the alternative to define and test additional null hypotheses - ever refining our understanding of the concept or process involved.

PRECISION

Results which are predictable and repeatable are precise, but precise results may or may not be accurate. For instance, if your measuring instrument is not calibrated properly you may get exactly the same result on ten successive measurements, which would be extremely precise, but your results would be inaccurate to the degree that the instrument was out of calibration. (see ACCURACY).

RANDOM

Random means without definite aim, direction, intention, or method; of equal probabilities. In other words, without bias. Random selection is a key assumption of statistical analysis, and as such, is critically important in designing scientific experiments and analyzing data. If one cannot be reasonably certain that samples were obtained in a randomized manner, the results obtained will be questionable.

In designing experiments, treatments should be randomized. For instance, if three pots of plants serve as a control and an additional three plants receive a fertilizer treatment, the three treatment pots should not all be next to the window while the control plants are lined up away from the window. To do so biases the experiment because the treatment plants would receive more light in addition to fertilizer. If the pots were set out in a random manner, however, it would be just as likely for a control plant as an experimental plant to be next to the window.

In sampling, it is important to decide where, how much and how many samples should be made ahead of time so as not to bias samples, eg. subconsciously choosing to sample where the largest bacterial colonies are located on a Petri dish. One way to do this is to lay out grid coordinates and then choose sample coordinates with a random numbers table.

In a table of random numbers, each integer from 0-9 has an equal and independent chance of occurring anywhere in the table. Likewise each two digit number 00-99 is randomly distributed as are three digit numbers 000-999, etc. To use the table, decide ahead of time: 1) what direction you want to read (either horizontally, vertically or diagonally, 2) how many digits are appropriate, and 3) where you want to begin in the table (the table should be entered randomly so as not to begin at the same point everytime you need to sample). For example, if you lay out grid coordinates 15 X 15, you want to use 2 digit numbers and use the first 2-digit number between 00 and 15 as your first coordinate and the second number between 00 and 15 as your second coordinate.

SAMPLING

A scientist can rarely collect all of the data about which she wants to draw conclusions. For example, it may be of interest to draw conclusions about the body weight of all 18 year old males in the United States. The only way to make statements about body weight of these men, with 100 % confidence is to weigh each individual - an impossible task. Instead, only some of the total number of 18 year old males are weighed and we infer from the results the total weights of all the individuals of interest. The men who are actually weighed are a statistical sample of the population.

The key to having a sample accurately represent the population is to obtain a random sample. Random sampling implies that each individual in the population has an equal chance of being selects as part of the sample, that is, there is no bias for or against any individuals being sampled. If samples are taken at random from a population, valid conclusions may be drawn about that population from a small sample - with a known chance of error. We can control the amount of error by varying the size of the sample. In general, the smaller the sample, the larger the chance of error; the larger the sample, the smaller the chance of error.

A random number table is useful in obtaining random samples (see RANDOM and APPENDIX 1). Decide ahead of time how you will use the table, eg. read the last three digits down a row or the first three digits across line, etc., then enter the table at a random point and begin to sample. A randomly selected number could represent an individual in a row of plantings, number of feet from a starting point, number of measurements taken of a dimension, etc.

A single measurement is not adequate to draw conclusions about a population. This is because it is not possible to know how reliably a character was measured. Repeated measurements may vary greatly, especially if made by different people. Therefore a series of repeated measurements, or replicate measurements, should be taken. From the collection of replicates, the mean and standard deviation will provide an estimate for the population as a whole. There are techniques to determine how many replicates are needed to achieve a certain level of reliability. As a general rule, three is a minimum number.

STATISTICS

There are three reasons statistics are important to scientists: first, they allow data to be quantitatively described and summarized; second, they allow generalized conclusions to be drawn based on relatively small sets of data; third, the differences and relationships between sets of data can be objectively analyzed.

VARIABLE

A variable is any factor in a situation that may change or vary. Investigators in science and other disciplines try to determine what variables influence the behavior of a system by manipulating one variable, called the independent variable and measuring its effect on another variable, called the dependent variable. As this is done, all other variables are held constant. If there is a change in only one variable and an effect is produced on another variable, then you can conclude that the effect has been brought about by the changes in the manipulated (independent) variable. If more than one variable changes, there can be no certainly at all about which of the changing variables causes the effect on the responding variable.

In a scientific investigation, measurements of the variables are made; however, you, the investigator, must decide how to measure each variable. An operational definition of a variable is a definition you determine for the purpose of measuring the variable during an investigation. Thus, different operational definitions of the same variable may be used by different investigators. For instance, in a growth rate experiment, you may decide to measure length of an organ at daily intervals while in another group, they may decide to measure the same dimension at six hour intervals. In many cases it is important that you choose an appropriate operational variable in order to observe a phenomenon. If the time interval you choose is too long, for example, you may miss an important stage of a processes you were looking for.

FACTUAL BACKGROUND

Descriptive Statistics

Living things are, by their nature, variable; a single individual, population, community etc. will not be exactly the same as any other. In order to describe any group of living things, statistics, descriptive measures derived from sample data, must be computed. One of the most common descriptive statistics is the mean, or average. If represents a datum (e.g. the height of a plant in cm) the mean of a sample of several plants is where (X bar) is the symbol of the mean, (sum of X's) is the total of all the plants' heights in the sample and (sample size) is the number of plants sampled.

e.g. The heights of five plants were 10.1, 11.4, 11.7, 12.1 and 13.3 cm respectively. Thus the average height of the plants in this sample was 11.72 cm. If these 5 plants were randomly selected from a larger group of plants, we may assume that the average for the larger population is also approximated by 11.72 cm.

As a general rule, the mean should not be considered more accurate than 1 significant figure beyond the accuracy of the original data. In the above example each plant was measured to within 1/10 cm (0.1). Therefore, the mean may be rounded to the nearest 1/100 (0.01).

Calculating a mean only gives a partial description of the data - the average value. It is usually necessary to also describe how much variability there is around the mean. The following two sets of data have the same mean: 1, 6, 11, 16, 21, and 10, 11, 11, 11, 12. You will probably agree that the mean is not enough to meaningfully describe both sets; some measure of variability is also necessary. Two measures of variability are commonly used, standard deviation(s) and standard error (S.E.). In each case the first step is to calculate the sum of squares (SS) - the sum of squared deviations from the mean. SS = _(x-)2 For the plant height data above, SS = (10.1 - 11.72)2 + (11.4 - 11.72)2 +(11.7 - 11.72)2 + (12.1 - 11.72)2 + (13.3 - 11.72)2 = 5.37 cm2. A simpler way to calculate SS is to use the formula SS = _X2 - (_X)2/n. Again using the data above:

_X = 10.1 + 11.4 + 11.7 + 12.1 + 13.3 = 58.6 cm

_X2 = (10.1)2 + (11.4)2 + (11.7)2 +(12.1)2 + (13.3)2 = 692.16 cm2

n = 5

SS = 692.16 cm2 - (58.6 cm)2/5

= 692.16 cm2 - 686.79 cm2

= 5.37 cm2

Standard deviation = _(SS/DF) where DF (degrees of freedom) = n - 1

In our example s = _(5.37 cm2/4) = 1.16 cm

The standard error (S.E.) is calculated by the formula: S.E. = s/_n, which for our example is 1.16 cm/_5 = 0.52 cm. Our data may now be accurately expressed either by stating

the mean and standard deviation ( = 11.72 cm, s = 1.16 cm) or by the mean and standard error

( = 11.72 _ 0.52 cm).

As you can see by the equation for standard error, the magnitude of this variation is

inversely proportional to the sample size. That is, the larger the sample size, the more precise the estimate of the population mean (i.e. large samples generally give better results than small samples!)

Comparing Two Means

It is frequently of interest to compare the means of two samples to draw conclusions about similarities or differences. For instance, are the results of a particular experimental treatment significantly different from the control? In some cases the difference may be very large and obvious, but in other cases the means and variances may be quite similar and an objective method is required to determine the degree of difference or similarity.

Student's t-test is commonly used to compare two means where the null hypothesis, Ho:1 = 2, is that the means are the same.

The t-statistic is calculated as: where the numerator is the absolute value of the mean of experiment 1 less the mean of experiment 2. The denominator is the standard error of the difference between the means and is calculated as: where n1 and n2 are the two sample sizes and sp2 is: where SS is the sum of squares and DF is the degrees of freedom for the two data sets.

The value obtained for "t" is now compared to the table of critical values of Students'-t. Values smaller than the critical value indicate a high probability that the null hypothesis, is supported. If the calculated t-value is greater than or equal to the critical value, then the null hypothesis is rejected and the alternate hypothesis, is accepted. The appropriate degrees of freedom, DF, is DF1 + DF2 (DF1 = N1-1; DF2 = N2-1; DF = N1 + N2-2).

Critical Values of Students'-t

DF

_ = 0.10

_ = 0.05

DF

_ = 0.10

_ = 0.05

1

6.31

12.71

11

1.80

2.20

2

2.92

4.31

12

1.78

2.18

3

2.35

3.18

13

1.77

2.16

4

2.13

2.78

14

1.76

2.14

5

2.01

2.57

15

1.75

2.13

6

1.94

2.45

16

1.75

2.12

7

1.89

2.36

17

1.74

2.11

8

1.86

2.31

18

1.73

2.10

9

1.83

2.26

19

1.73

2.09

10

1.81

2.23

20

1.72

2.09

(Note: About, the level of significance. Rejecting the null hypothesis, when it is in fact true, is called a Type I error. The probability of making a Type I error is called the level of significance; _ = 0.05 means the probability of a false rejection is only 5%. Unfortunately, by reducing the probability of making a Type I error, you are increasing the chance of accepting the null hypothesis when it is not true- a Type II error!).

Randomly Assorted Digits

00-04

05-09

10-14

15-19

20-24

25-29

30-34

35-39

40-44

45-49

00

54463

22662

56905

70639

79365

67382

29085

23248

47058

08186

01

15389

85205

11885

39226

22490

90669

96325

56891

60933

26927

02

85941

40756

82414

02015

13858

78060

16269

62773

10389

15345

03

61149

69440

11286

88218

58928

03638

52862

72095

33451

77455

04

05219

81619

10651

67079

92511

59888

84502

20002

83463

75577

05

41417

89326

87719

92294

46614

50948

64886

05217

97365

30976

06

28357

94070

20652

35774

16249

75019

21145

62481

47286

67305

07

17783

00015

10806

83091

91530

36466

39981

62660

49177

75779

08

40950

84820

29881

85966

62800

70629

84740

96488

77379

90279

09

82995

64157

66164

41180

10089

73417

78258

32940

88629

37231

10

96754

17676

55659

44015

47361

37032

86679

22885

53249

27083

11

34357

88040

53364

71726

45690

66334

60332

48888

90600

71113

12

06318

37403

49927

57715

50423

67372

63116

73947

21505

87018

13

62111

52820

07243

79931

89292

84767

85693

54440

22278

11551

14

47534

09243

67879

00554

23410

12740

02540

73443

32949

13491

15

98614

75993

84460

62846

59844

14922

48730

70158

48167

34770

16

24856

03648

44898

09351

98795

18644

39465

53342

90638

44105

17

96887

12479

80621

66223

86085

78285

02432

21434

42846

94771

18

90801

21472

42815

77408

37390

76766

52615

41943

30268

18106

19

55165

89522

83666

36028

28420

70219

81369

89872

47366

41067

20

75884

03033

84318

95108

75305

64620

91318

87894

45375

85439

21

16777

38804

56882

42859

21460

43910

01175

03164

81378

40620

22

46230

43877

80207

88844

89380

32992

94380

60137

98656

59637

23

42902

66892

46134

01432

94710

23474

20423

19774

60609

43449

24

81007

00333

39693

28039

10154

95425

2230

97017

31782

41550

25

68089

01122

51111

72373

06902

74676

96199

87236

41273

21546

26

20411

67081

89950

16944

93054

87687

96693

58680

77045

33848

27

58212

13160

06468

15718

82627

76999

05999

12696

96739

69700

28

70577

42866

24969

61210

76049

67699

42054

05437

93758

03283

29

94522

75628

71659

62038

79643

79169

44741

32404

39038

13163

30

42626

86819

85651

88678

17401

03252

99547

48769

17918

62880

31

16051

33769

57194

16752

54450

19031

58580

55781

54132

60631

32

08244

27647

33851

44705

94211

46716

11738

17805

95374

72655

33

59497

04392

09419

89964

51211

04894

72882

82171

21896

83864

34

97155

13428

40293

00998

58434

01412

69124

39435

59085

8285

35

89409

66162

95769

47420

20792

61527

20441

61203

11859

41567

36

45476

8882

56109

96597

25930

66790

65706

66669

53634

22557

37

89300

68799

50741

30329

11685

23166

05400

65578

48708

03887

38

50051

95137

91631

66315

97428

12275

24816

45153

71710

33258

39

361753

85178

31310

89642

98364

02306

24617

09609

83942

22716

40

79152

53829

77250

20190

56535

18760

69942

77448

33278

48805

41

44560

38750

82635

56540

64900

42912

13953

79149

18711

68618

42

38328

83378

63369

71381

39564

05615

42451

64559

97507

65747

43

46939

38689

58625

08324

30459

85863

20781

09284

23666

91777

44

83544

86141

15707

96256

23068

04372

08467

89496

93848

55376

45

91621

00881

04900

54224

46177

55309

17852

27491

89415

23466

46

91869

67126

04151

03795

59077

01848

12630

98375

52068

60427

47

55751

62515

21108

80830

02263

29303

37204

96926

30506

97111

48

85156

87689

95493

88842

00664

55017

55539

17771

69448

01117

49

07521

56898

12236

60277

39402

62315

12239

07105

11844

44517

PROCEDURE

Prelaboratory.

Start out with a hypothetical scenario that will force students to think about solving problems and introduce them to sampling and some basic statistics at the same time. For instance, relate a scenario similar to the following. You have to supplement your meager GTA income and are thinking about opening up a car dealership (shoe store, clothing shop, sunglass shop, etc.) catering to the college community. You want some data to determine what company (lines, brands, items, etc.) you should carry - and you need this information by the end of the hour in order to secure your small business loan.

To get a rough idea you could survey the students in your recitation - make a list of how many drive: Chevys, Fords, Hundais, Hondas, Nissans, Toyotas, etc. Ask the class to make a suggestion, based on the list, of what would be the best choice. Their consensus response is the class hypothesis.

Now, how to test the hypothesis? Allow groups to brainstorm among themselves for about 5 minutes to generate a list of possibilities. Then go group to group, listing possible suggestions on the board. The constricting factors are the time and the fact that you're keying in on the university community. A variety of survey methods will probably be suggested, including the library, telephone calls to theDepartment of Motor Vehicles, etc.

What we are looking for is to have each group go out to some street location on campus and tally the cars going by, or to a parking lot to tally the vehicles parked. Get the class to agree on a sampling technique.

Some things to consider are: Where to sample?, How long to sample?, Cars in both directions or a single direction, How to tally results, eg. which makes?, To sample every car, every other, every fifth, etc.? Once the class agrees on the data collection procedure, send them off, by groups, to collect their data. Set a time limit for them to return to class with their results. Twenty minutes is reasonable to get to a location, collect data for about 5 minutes, and return.

While the students are out collecting data, prepare a table on the board, or an overhead, to summarize class data. An example follows.

Make

Group 1

Group 2

Group 3

Group 4

Chevy

Ford

Honda

Hundai

Nissan

Toyota

others

As student groups return, they should place their data into the appropriate column of the summary table.

Each student should make a complete copy of the summary data to work on before lab.

The student assignment, in preparation for lab, is to calculate two basic summary statistics, mean and standard deviation, using the summary class data table.

Laboratory Activity 1.

During the recitation, students were presented with a problem and asked to find a solution. In actuality they were asked to put the scientific method into practice, without being told that this is what they were doing. During the first part of the laboratory we want them to "discover" the scientific method. Of course, they have been taught the "steps" in their previous schooling and could probably list and describe the steps for you if you asked them to. But teaching it in this way tends to reinforce the misconception that the scientific method is some rigid system that only scientists use. In actuality, it is simply a formalization of every day, common sense problem solving, and it is precisely this point that we want to make.

We want to introduce the concept of the scientific method by reviewing the activities of the recitation session. This will also provide an opportunity to formally introduce concept mapping as a technique. Begin by recalling for your students the problem you asked them to help you solve, eg. which kind of car dealership to open. Place this in a box, high on the board (it will become the principle concept in the map the class develops). Now recall that your preliminary class survey suggested that a particular make would probably be best. This is the hypothesis which can be placed in a second box, under the first, as shown below.

Best Dealership is probably Toyota

Next you decided on how the class could test if indeed these preliminary observations were valid for the broader university community. First, you had to decide on what criteria would be the best to check, given the constraints you set for the problem. Some of the possibilities may have been: most licenses sold, profits of existing dealerships, most vehicles sold, highest profit/car, etc. Next, you had to decide on a sampling method, which itself had several parameters. Once data was collected it had to be summarized and analyzed, and finally a conclusion had to be reached concerning whether or not the data supported the original hypothesis. All of this can be summarized in a concept map such as the one presented below. Work this through on the board or an overhead with your class. Once a consensus map has been agreed upon, have your students copy it in the space provided on their laboratory handout. This will serve as an example to them as they construct their own maps during future activities.

You are now ready to make a very important point to your students. The problem solving approach they used to determine the best business for you to invest in was really application of the scientific method. In other words, the scientific method is not some mystical technique employed only by scientists locked away in a laboratory somewhere (and out of touch with the real world). Instead, it is simply a formalization of common sense problem solving that we all do every day! To illustrate this, first make a concept map of the scientific method (some of your students will surely have memorized the steps in some previous course so they will be able to list them, in order, for you.

Now have your students go back to their concept map of how the problem was solved and draw an enclosing circle around every box that is part of observation. Then do the same for every hypotheses. Repeat for all the boxes that are experimental, and finally those that are conclusions.

Activity 2.

Students' homework assignment between recitation and laboratory was to calculate the summary statistics on their class summary data sheets. Now we want to introduce them to the computer program for statistical analysis, and simultaneously give them an opportunity to check their own work.

There should be one computer workstation for each group of six students per table. Have the students group around their workstation. Specific instructions at this point depends on the hardware and software available. Enter the statistics package and choose "Mean and Standard Deviation". The computer will prompt for data entries. For instance, the first student should enter the data collected for "Chevy". The printout will give mean and standard deviation which will allow students to check their calculated values. They should indicate with a check on their table if their calculated value was correct. If incorrect, they should strike out the incorrect answer with a single line, then write in the correct value.

Have a second student do the same for the second model sampled. A third student does the same for the third model, etc., until each student has entered data and calculated values for one data set.

This sheet will be turned in by students for credit.

HINTS AND SUGGESTIONS

The following ideas frequently come up during the class discussion of how to gather data

Passive sampling - check library for information (eg. sales and/or tax records)

- phone state Department of Motor Vehicles

- phone Auto dealers in area

Active sampling - passing cars on a street corner

- which one?

- one direction or both?

- for how long?

- parking lots

- which one?

- every car or a sub sampling?

MINIMUM MATERIALS NEEDED

Computer with statistical package is handy, but not absolutely necessary.

EXPECTED STUDENT OUTCOMES

Some student groups will probably have data significantly different from that of other groups. Further discussion at this point usually reveals differences in sampling techniques that were not standardized previously. This also allows for discussion of the legitimacy of pooling data in this experiment. Students now will be aware of these kinds of problems and should consider them in designing future experiments.

An example of actual class data:

Make

Group

1

Group

2

Group

3

Group

4

Sum

Mean

St.Dev.

S.E.

TIME FRAME

The activity was originally designed for a one hour recitation followed by the first part of a three hour laboratory period. The actual data collection could be designed as a homework activity for the group so that the problem could be set up during the last 20-30 minutes of a lecture and then the data collated and analyzed during the subsequent laboratory period.

STUDENT PROTOCOL

INTRODUCTION

Investigative laboratories will probably be very different for you from those in any previous science course you have taken. It may be very frustrating, at first, because you will not have specific directions to follow or lists of terms and concepts to memorize. Instead, you and your research team will have to become self-reliant in making observations, forming questions, proposing hypotheses, in particular null hypotheses, designing tests of you hypotheses and analyzing and interpreting the results of your tests.

YOU ARE EXPECTED TO DO THE WORK.

OBJECTIVES

1. You should be able to construct a concept map to show relationships between ideas.

(note: concept mapping is THE BEST way to study and learn new information, eg., after each lecture, map your notes and then go back and connect the day's map to what you learned previously in the course.)

2. You should be able to solve problems in a rational way (i.e., using the scientific

method).

3. You should understand the importance of repeating observations and be able to calculate

and interpret summary statistics.

SPECIFIC INSTRUCTIONS

During this recitation you are asked to collect some data, as a group, then combine the results from your group with that of other groups. Use the table outline below as a guide - add additional columns and/or rows as needed.

Make

Group

1

Group

2

Group

3

Group

4

Sum

Mean

St.Dev.

S.E.

During class, you will decide on how best to collect data to test the hypothesis the class decides upon. Each group, including yours, will have to collect data, then bring it back to class to share with other members of your section. You will need a personal copy of all the group data from your lab section. Therefore, as groups return from collecting data, they will fill in a class summary form, similar to the one above. You should copy this information onto your form.

Your assignment, to be completed BEFORE lab, is to calculate the summary statistics mean and standard error for the four most common car makes. The procedures for doing this are listed under "Statistics" in your resource manual. The first thing you will have to do is add up the total numbers for each row. This is filled in under the column "Sum". The sum divided by the number of samples taken (4, i.e., one for each group) is the mean, or average. The procedures for calculating standard deviation (St. Dev.) and standard error (S.E.) are described in the resource manual.

Your instructor will introduce you to the technique of concept mapping in the laboratory.

QUIZ

NAME: _________________________________________________________

Section #: ___

1. In 1903 a federal commission measured the flow of the Colorado River to determine the

volume of water flow that could be divided by treaty among seven western states and Mexico. Based on that survey, 15 million acre feet of water was partitioned. Currently there are water rights disputes in the West because the average annual water flow of the Colorado River is actually about 14 million acre feet. What basic experimental principle should the government have used, at the turn of this century, to obtain a more accurate measure of volume flow? Describe in one or two paragraphs how you would have determined the amount of water that should have been allocated.