Cladogram Builder
Choose species and a gene region
To build a cladogram we need the same gene from each species so we can compare differences. The default example uses five Eucalyptus species and the internal transcribed spacer (ITS) region — a non-coding stretch on the genes that make ribosomes (similar to introns). It evolves quickly, making it useful for distinguishing closely related plant species.
Other options to try: rbcL (plants), matK (plants), cytochrome b (animals), 16S ribosomal RNA (bacteria), COI (animals).
Species to compare
The scientific name is used to search NCBI. The display name is what appears on your tree.
"Species name"[Title] AND your gene term. If multiple matching records exist, the first one is used.
Fetched DNA sequences (FASTA)
These are the actual gene sequences pulled live from the NCBI GenBank database. Each entry has an accession number (the unique ID in the database), then a long string of A, T, G, and C — the four DNA bases. Notice how similar but not identical they look.
Multiple sequence alignment
The sequences are now lined up base-by-base. Gaps (-) are inserted where a base appears in some species but not others — this happens because of insertions and deletions (indels) during evolution. An * below a column means every species has the same base there (a conserved position). Columns without an asterisk show variation between species — that's the raw data we'll use to build the tree.
Genetic difference matrix
Now we count the differences between every pair of aligned sequences. This is the heart of the molecular clock idea — more differences means more time since the species shared a common ancestor (assuming mutations accumulate at a roughly steady rate). The smallest numbers indicate the most closely related species.
Note: only positions where both sequences have an actual base (not a gap) are compared. This way, a short or partial sequence doesn't get unfairly penalised for missing data — only real base differences count.
Your turn: Looking at the matrix above, which two species do you think are the most closely related? (They should appear as sister taxa on the cladogram.)
Evolutionary tree
The tree is built by Clustal Omega using its guide-tree-based neighbour-joining approach. Toggle between cladogram view (branches all the same length — shows only the order of divergence) and phylogram view (branch lengths proportional to genetic distance — shows how much evolution has happened).