User Manual for KFSP v1.0

 

------------------------------------------------------------------------

 

    How to use KFSP v1.0?

 

------------------------------------------------------------------------

 

The program takes two parameters, the first is the filename of the model file, and the second is the filename of the protein sequence file. The sequence file contains data in lines: one line of sequence name followed by one line of the sequence.

 

You may download pre-computed model files for kinase families at:
http://bioinfo.math.pku.edu.cn/~yuhuan/HSL/model.tgz

    Uncompress the downloaded file you can see 81 model files with filenames as its E.C. classification number.

 

    The program will use the given model to predict functional site in give sequences. The sample results look like following:

Testing Sequence: MGKVAVGATVVCTAAVCAVAVLVVRRRMQSSGKWGRVLAILKAFEEDCATPISKLRQVADAMTVEMHAGLASDGGSKLKMLISYVDNLPSGDEKGLFYALDLGGTNFRVMRVLLGGKQERVVKQEFEEVSIPPHLMTGGSDELFNFIAEALAKFVATECEDFHLPEGRQRELGFTFSFPVKQTSLSSGSLIKWTKGFSIEEAVGQDVVGALNKALERVGLDMRIAALVNDTVGTLAGGRYYNPDVVAAVILGTGTNAAYVERATAIPKWHGLLPKSGEMVINMEWGNFRSSHLPLTEFDHTLDFESLNPGEQILEKIISGMYLGEILRRVLLKMAEDAAFFGDTVPSKLRIPFIIRTPHMSAMHNDTSPDLKIVGSKIKDILEVPTTSLKMRKVVISLCNIIATRGARLSAAGIYGILKKLGRDTTKDEEVQKSVIAMDGGLFEHYTQFSECMESSLKELLGDEASGSVEVTHSNDGSGIGAALLAASHSLYLEDS

Model 1 matches with your sequence

Match: ALDLGGTNFRV

99      109

====

Match: LGFTFSFP

172     179

====

Match: WTKGF

193     197

====

Match: VNDTVGT

228     234

====

Match: INMEWG

281     286

====

Match: SGMY

319     322

====

Match: MYLGEI

321     326

====

Match: DGSG

476     479

====

Match: GAAL

481     484

====

Testing Sequence: MNTINIAKNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEARFRKMFERQLKAGEVADNAAAKPLITTLLPKMIARINDWFEEVKAKRGKRPTAFQFLQEIKPEAVAYITIKTTLACLTSADNTTVQAVASAIGRAIEDEARFGRIRDLEAKHFKKNVEEQLNKRVGHVYKKAFMQVVEADMLSKGLLGGEAWSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQNAGVVGQDSETIELAPEYAEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTHSKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKWKHCPVEDIPAIEREELPMKPEDIDMNPEALTAWKRAAAAVYRKDKARKSRRISLEFMLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGLLTLAKGKPIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENTWWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAMLRDEVGGRAVNLLPSETVQDIYGIVAKKVNEILQADAINGTDNEVVTVTDENTGEISEKVKLGTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFGFRQQVLEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLKSAAKLLAAEVKDKKTGEILRKRCAVHWVTPDGFPVWQEYKKPIQTRLNLMFLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHEKYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESCDVLADFYDQFADQLHESQLDKMPALPAKGNLNLRDILESDFAFA

Doesn't match!

 

For each sequence, we will predict functional site in it with the model file. The results include both site sequence and position. If the sequence doesn’t belong to the family, program will output simply “Doesn’t match!”.