Ok, maybe it’s time for a Genetics 101 lesson:
DNA is made up of 4 different kinds of nucleotides (Adenine, cytosine, guanine and thymine or A,C,G and T). DNA is nothing but a very long string of these nucleotides. Any one gene will have specific nucleotides….like a written word has specific letters in a specific order.
Very basically, there are two kinds of genes: coding genes and non-coding genes. Coding genes specify a cell to make a specific protein. These proteins are the building blocks of all cellular functions. Without them, life does not exist. Non-coding genes don’t code for anything. They are also referred to as “junk” DNA. These genes have no known purpose and just "go along for the ride".
From time to time DNA mutates. An A might turn into a G, or a C might turn into a T. If this happens in coding DNA it’s usually bad news. The resulting protein will be messed up and the cell can’t do its job or will do it the wrong way. This can be seen in some cancers, birth defects etc. Many times the organism will die. Now, if the mutation happens in non-coding DNA it’s no big deal. Since it doesn’t affect any subsequent protein, the organism still lives and reproduces, passing along that mutated non-coding gene to it's offspring.
When we compare the DNA of different species, we generally compare the non-coding DNA. You could use coding DNA, but because this DNA can’t and doesn’t change much it looks much the same in very different organisms. You and I actually have many coding genes identical to those in bacteria. But the non-coding DNA in me looks very different from a bacteria’s non-coding DNA. Because I am more closely related to a dog than bacteria (we are closer in the evolutionary tree) my non-coding DNA will be more similar to a dog than a bacterium. Likewise, my non-coding DNA will be a closer match to a chimps DNA than a dog.
If I have an unknown organism, I can compare its DNA to the DNA of most other known species. There’s been a lot of work, sequencing non-coding genes in many many species the past few decades. And most of those sequences are stored in an electronic database known as GenBank. So, if I have a sequence of a BF gene, I can easily compare it to the thousands of species that have had a gene sequenced already. TADA…….I then know if my BF sequence is genuine (because it will be different from all other species). And I’ll know what species BF is most closely related to.
Unlike hair, or footprints, or blobby photos, or a film, DNA is unambiguous. I can’t change the DNA in the organism I’m sequencing. It can’t be faked or hoaxed. Nor is it subject to interpretation or opinion. The nucleotides in a sequence are in a specific pattern that anyone can read clearly. And if I sequence the same gene from the same organism again, it will result in the exact same sequence.
I’ll save this all as a word document so I can post it again in the future. I have taken a few minor liberties to keep this explanation simple. So, if anyone has corrections or clarifications, I’ll add them.
I suppose I could explain the sequencing process, but I’m getting tired of typing. And anyway, college students pay good money to learn this stuff from PhD’s, and I'm not getting squat for this!
