Science as a Process

What is Science?

Science Asks Three Basic Questions

  1. What's there?
    The geologist examines rocks in the Grand Canyon, the nuclear physicist bombarding atoms and sees what hapens, the tropical biologist describes a previously undiscovered spider in Costa Rica, the paleontologist digging in promising strata looking for plant fossils, are all seeking to find out, 'What's there?'
  2. How does it work?
    A geologist comparing the effects of time on moon rocks to the effects of time on earth rocks, the nuclear physicist observing the behavior of particles, the tropical biologist studies the mating activites of the spider, and the paleontologist studying the reproductive structures of the plant fossils are asking, 'How does it work?'
  3. How did it come to be this way?
    Each of these scientists tries to reconstruct the histories of their objects of study. Whether these objects are rocks, elementary particles, tropical organisms, or fossils, scientists are asking, 'How did it come to be this way?'

Science is frequently only presented as the 'Scientific Method.'  While there is nothing wrong with the scientific method, it is not the only means through which science is done.

Introduction to the Scientific Method

Step 1: Observation and/or asking a question

  • Science occurs when someone is curious about a natual phenomenum and asks questions about it
  • The first scientists only sought to explain the natural world around them (why does it rain, why are days longer in summer than in winter, etc...

Step 2: The Hypothesis

  • A hypothesis is a tentative answer or solution to the question or problem at hand using natual means to explain the question
  • A good hypothesis is structured as an 'if-then' statement (i.e. if 'A' happens then 'B' will occur)
  • Hypotheses are general causes or explanations of phenomena
  • Hypotheses reflect past experience with similar questions or situations
  • Multiple hypotheses should be proposed whenever possible
  • Hypotheses must be testable
    • If you cannot disprove it (or at least come up with a way to invalidate the hypothesis), it is not a valid hypothesis
  • Hypotheses can be eliminated but never confirmed with absolute certainty - they can never be proved, only disproved
Step 3: Experimental Design
  • Once predictions are made, they can be tested by experiments.
  • If test results contradict predictions, then the hypotheses are called into question and explanations may be sought.
  • Sometimes experiments are conducted incorrectly and are at fault.
  • If the results confirm the predictions, then the hypotheses are considered likely to be correct but might still be wrong and are subject to further testing.
  • The experimental control is a technique for dealing with observational error.
  • This technique uses the contrast between multiple samples (or observations) under differing conditions, to see what varies or what remains the same.

Step 4: Data Collection and Interpretation

  • Care must be taken with your experimental data, but after you have collected, you must perform an analysis to see if your data support or refute your hypothesis
  • If the data support your hypothesis, this is helpful, but it does not mean your hypothesis is valid - further experimentation and posible modification of your initial experiments must be done to further support your hypothesis
  • If the data do not support your hypothesis, your hypothesis may be invalid. You should repeat the expriemnts for further varification, but if that is done and the data still refute your hypothesis, then you should revise your hypothesis and then test it

Example: Why do Giraffes have long necks?


Theories & Models


Inductive vs. Deductive Reasoning


Experimental Design

We are concerned with the analysis of data generated from an experiment. It is wise to take time and effort to organize the experiment properly to ensure that the right type of data, and enough of it, is available to answer the questions of interest as clearly and efficiently as possible. This process is called experimental design .

The specific questions that the experiment is intended to answer must be clearly identified before carrying out the experiment. We should also attempt to identify known or expected sources of variability in the experimental units since one of the main aims of a designed experiment is to reduce the effect of these sources of variability on the answers to questions of interest. That is, we design the experiment in order to improve the precision of our answers.
( Definition taken from Valerie J. Easton and John H. McColl's Statistics Glossary v1.1 )

Control

Suppose a farmer wishes to evaluate a new fertilizer. She uses the new fertilizer on one field of crops (A), while using her current fertilizer on another field of crops (B). The irrigation system on field A has recently been repaired and provides adequate water to all of the crops, while the system on field B will not be repaired until next season. She concludes that the new fertilizer is far superior.

The problem with this experiment is that the farmer has neglected to control for the effect of the differences in irrigation. This leads to experimental bias , the favoring of certain outcomes over others. To avoid this bias, the farmer should have tested the new fertilizer in identical conditions to the control group, which did not receive the treatment. Without controlling for outside variables, the farmer cannot conclude that it was the effect of the fertilizer, and not the irrigation system, that produced a better yield of crops.

Another type of bias that is most apparent in medical experiments is the placebo effect . Since many patients are confident that a treatment will positively affect them, they react to a control treatment which actually has no physical affect at all, such as a sugar pill. For this reason, it is important to include control, or placebo, groups in medical experiments to evaluate the difference between the placebo effect and the actual effect of the treatment.

The simple existence of placebo groups is sometimes not sufficient for avoiding bias in experiments. If members of the placebo group have any knowledge (or suspicion) that they are not being given an actual treatment, then the effect of the treatment cannot be accurately assessed. For this reason, double-blind experiments are generally preferable. In this case, neither the experimenters nor the subjects are aware of the subjects' group status. This eliminates the possibility that the experimenters will treat the placebo group differently from the treatment group, further reducing experimental bias.

Randomization

Because it is generally extremely difficult for experimenters to eliminate bias using only their expert judgment, the use of randomization in experiments is common practice. In a randomized experimental design, objects or individuals are randomly assigned (by chance) to an experimental group. Using randomization is the most reliable method of creating homogeneous treatment groups, without involving any potential biases or judgments. There are several variations of randomized experimental designs, two of which are briefly discussed below. In a completely randomized design , objects or subjects are assigned to groups completely at random. One standard method for assigning subjects to treatment groups is to label each subject, then use a table of random numbers to select from the labelled subjects. This may also be accomplished using a computer. In MINITAB, the 'SAMPLE' command will select a random sample of a specified size from a list of objects or numbers. If an experimenter is aware of specific differences among groups of subjects or objects within an experimental group, he or she may prefer a randomized block design to a completely randomized design. In a block design, experimental subjects are first divided into homogeneous blocks before they are randomly assigned to a treatment group. If, for instance, an experimenter had reason to believe that age might be a significant factor in the effect of a given medication, he might choose to first divide the experimental subjects into age groups, such as under 30 years old, 30-60 years old, and over 60 years old. Then, within each age level, individuals would be assigned to treatment groups using a completely randomized design. In a block design, both control and randomization are considered.

Example

A researcher is carrying out a study of the effectiveness of four different skin creams for the treatment of a certain skin disease. He has eighty subjects and plans to divide them into 4 treatment groups of twenty subjects each. Using a randomized block design, the subjects are assessed and put in blocks of four according to how severe their skin condition is; the four most severe cases are the first block, the next four most severe cases are the second block, and so on to the twentieth block. The four members of each block are then randomly assigned, one to each of the four treatment groups.
( Example taken from Valerie J. Easton and John H. McColl's Statistics Glossary v1.1 )

Replication

Although randomization helps to insure that treatment groups are as similar as possible, the results of a single experiment, applied to a small number of objects or subjects, should not be accepted without question. Randomly selecting two individuals from a group of four and applying a treatment with 'great success' generally will not impress the public or convince anyone of the effectiveness of the treatment. To improve the significance of an experimental result, replication , the repetition of an experiment on a large group of subjects, is required. If a treatment is truly effeciive, the long-term averaging effect of replication will reflect its experimental worth. If it is not effective, then the few members of the experimental population who may have reacted to the treatment will be negated by the large numbers of subjects who were unaffected by it. Replication reduces variability in experimental results, increasing their significance and the confidence level with which a researcher can draw conclusions about an experimental factor.

The above modifed from: http://www.stat.yale.edu/Courses/1997-98/101/expdes.htm


Other Things to Watch For