Sent to CCL by: "Dan T Major" [majort^mail.biu.ac.il]
Hi,
We have generated tabulated data on the damage caused to nucleic acids (NA)
segments (the sequence of which is known) due to exposure to various chemicals.
We would like to decipher NA damage patterns out of this data.
Specifically, we would like to identify short consensus NA sequences interacting
specifically with certain chemicals (i.e. which consensus sequence is sensitive
to which chemical, and in what typical way is a consensus sequence damaged)
based on the experimental data.
Can you propose a suitable software/methodology for
data-mining/clustering?
Many thanks>