I’ve been thinking a lot lately about the possibility of running a multiple regression analysis in order to help stratify certain aspects of bigfoot behavior. This won’t prove or disprove anything but it could turn out to be a useful research tool.
First, here’s a little information concerning the definition of a multiple regression analysis;
regression analysis
n : the use of regression to make quantitative predictions of one variable from the values of another
also
A statistical technique used to find relationships between variables for the purpose of predicting future values.
The general purpose of multiple regression is to learn more about the relationship between several independent or predictor variables and a dependent or criterion variable. For example, a bigfoot researcher might record for each sighting the size of the bigfoot if witnessed, the size of any footprint found, the time of day or night that the sighting occurred, any limb twists/limb formations found, the season, the weather conditions, the geographic location, and a subjective rating of observed behavior. Once this information has been compiled for various sightings it would be interesting to see whether and how these measures relate. For example, it has been theorized that the bigfoot in the southern portion of the U.S. seem to be more aggressive than those in the Pacific Northwest. A multiple regression analysis, while not being able to prove or disprove this theory, can help correlate a geographic layout that will either shore up, or dilute the theory.
This is actually a very simplified example in that certain factors would have to be applied (observed behavior) in order to increase the reliability of the analysis. Also, it will be important to attempt to guarantee a good representative sample. Otherwise, the degree of error can become so great as to render the analysis inconclusive.
This will prove to be a daunting task. The compilation of data alone will be the hardest part of this project. There are literally thousands of reports that will have to be gone over to obtain the necessary information. Also, a template will have to be developed in order to correctly enter the data. Once that is all completed however, the analysis itself is really no more than a few keystrokes away.
I’m still trying to sort out the best way to approach this. I’ve got a friend who’s a statistics professor here at the local community college. I’m going to run these thoughts by him and see if he can get me started in the right direction.
If anyone else out there has any statistical background and you’d like to add to this (or better yet, point out any obvious errors) please fire it on in! As I said, I’m not an expert.
AND…if anyone is aware of a multiple regression analysis that’s already been done, please post a link here. This is one wheel I’d hate to reinvent.