First page Back Continue Last page Overview Graphics
Representing Biological Data
Basic sequence data string of letters
- A, C, G, T (DNA)
- A,C,D,E, etc for Amino Acids
Can be represented in several ways
Substitute arbitrary numbers for letters
- e.g. A=1, C=2, G=3, T=4
- doesn’t reflect some properties of the bases
- problems dealing with uncertainty
- Theoretical problems (measurement theory)