Bioinformatics

Molecular biology in the internet

Main page

Appointments

Bioinformatics

Literature

Exercises

Tasks

Databases

Software

Sequence comparisons

Homology searches

Motif searches

Hidden Markov models

Hydrophobicity analyses

Topology and helix packing

Protein localization

Secondary structure

Super-secondary structure

3D structure

 

    Exercise #6:

    Localization of bacterial proteins

    Given are eight amino acid sequences. Try to predict the localization of the proteins. Compare several prediction algorithms.

    Protein 1:
            1 mkiktgaril alsalttmmf sasalakiee gklviwingd kgynglaevg kkfekdtgik
           61 vtvehpdkle ekfpqvaatg dgpdiifwah drfggyaqsg llaeitpdka fqdklypftw
          121 davryngkli aypiaveals liynkdllpn ppktweeipa ldkelkakgk salmfnlqep
          181 yftwpliaad ggyafkyeng kydikdvgvd nagakagltf lvdliknkhm nadtdysiae
          241 aafnkgetam tingpwawsn idtskvnygv tvlptfkgqp skpfvgvlsa ginaaspnke
          301 lakeflenyl ltdegleavn kdkplgaval ksyeeelakd priaatmena qkgeimpnip
          361 qmsafwyavr tavinaasgr qtvdealkda qtritk
    
    Protein 2:
            1 mdvikkkhww qsdalkwsvl gllgllvgyl vvlmyaqgey lfaittlils saglyifanr
           61 kayawryvyp gmagmglfvl fplvctiaia ftnysstnql tferaqevll drswqagkty
          121 nfglypagde wqlalsdget gknylsdafk fggeqklqlk ettaqpeger anlrvitqnr
          181 qalsditail pdgnkvmmss lrqfsgtqpl ytldgdgtlt nnqsgvkyrp nnqigfyqsi
          241 tadgnwgdek lspgytvttg wknftrvftd egiqkpflai fvwtvvfsli tvfltvavgm
          301 vlaclvqwea lrgkavyrvl lilpyavpsf isilifkglf nqsfgeinmm lsalfgvkpa
          361 wfsdpttart mliivntwlg ypymmilcmg llkaipddly easamdgagp fqnffkitlp
          421 llikpltplm iasfafnfnn fvliqlltng gpdrlgtttp agytdllvny tyriafeggg
          481 gqdfglaaai atlifllvga laivnlkatr mkfd
    
    Protein 3:
            1 mamvqpksqk arlfithlll llfiaaimfp llmvvaislr qgnfatgsli peqiswdhwk
           61 lalgfsveqa dgritpppfp vllwlwnsvk vagisaigiv alsttcayaf armrfpgkat
          121 llkgmlifqm fpavlslval yalfdrlgey ipfiglnthg gvifaylggi alhvwtikgy
          181 fetidsslee aaaldgatpw qafrlvllpl svpilavvfi lsfiaaitev pvaslllrdv
          241 nsytlavgmq qylnpqnylw gdfaaaavms alpitivfll aqrwlvnglt aggvkg
    
    Protein 4:
            1 masvqlqnvt kawgevvvsk dinldihege fvvfvgpsgc gkstllrmia gletitsgdl
           61 figekrmndt ppaergvgmv fqsyalyphl svaenmsfgl klagakkevi nqrvnqvaev
          121 lqlahlldrk pkalsggqrq rvaigrtlva epsvflldep lsnldaalrv qmrieisrlh
          181 krlgrtmiyv thdqveamtl adkivvldag rvaqvgkple lyhypadrfv agfigspkmn
          241 flpvkvtata idqvqvelpm pnrqqvwlpv esrdvqvgan mslgirpehl lpsdiadvil
          301 egevqvveql gnetqihiqi psirqnlvyr qndvvlveeg atfaiglppe rchlfredgt
          361 acrrlhkepg v
    
    Protein 5:
            1 mkktaiaiav alagfatvaq aapkdntwyt gaklgwsqyh dtgfinnngp thenqlgaga
           61 fggyqvnpyv gfemgydwlg rmpykgsven gaykaqgvql taklgypitd dldiytrlgg
          121 mvwradtksn vygknhdtgv spvfaggvey aitpeiatrl eyqwtnnigd ahtigtrpdn
          181 gmlslgvsyr fgqgeaapvv apapapapev qtkhftlksd vlfnfnkatl kpegqaaldq
          241 lysqlsnldp kdgsvvvlgy tdrigsdayn qglserraqs vvdyliskgi padkisargm
          301 gesnpvtgnt cdnvkqraal idclapdrrv eievkgikdv vtqpqa
    
    Protein 6:
            1 mskateqndk lkraiiisav lhvilfaali wssfdeniea sagggggssi davmvdsgav
           61 veqykrmqsq essakrsdeq rkmkeqqaae elrekqaaeq erlkqleker laaqeqkkqa
          121 eeaakqaelk qkqaeeaaak aaadakakae adakaaeeaa kkaaadakkk aeaeaakaaa
          181 eaqkkaeaaa aalkkkaeaa eaaaaearkk aateaaekak aeaekkaaae kaaadkkaaa
          241 ekaaadkkaa ekaaaekaaa dkkaaaekaa adkkaaaaka aaekaaaaka aaeaddifge
          301 lssgknapkt gggakgnnas pagsgntknn gasgadinny agqiksaies kfydassyag
          361 ktctlrikla pdgmlldikp eggdpalcqa alaaaklaki pkppsqavye vfknapldfk
          421 p
    
    Protein 7:
            1 mtldlprrfp wptllsvcih gavvagllyt svhqvielpa paqpisvtmv tpadleppqa
           61 vqpppepvve pepepepipe ppkeapvvie kpkpkpkpkp kpvkkvqeqp krdvkpvesr
          121 paspfentap arltsstata atskpvtsva sgpralsrnq pqyparaqal riegqvkvkf
          181 dvtpdgrvdn vqilsakpan mferevknam rrwryepgkp gsgivvnilf kingtteiq
    
    Protein 8:
            1 marsktaqpk hslrkiavvv atavsgmsvy aqaavepked titvtaapap qesawgpaat
           61 iaarqsatgt ktdtpiqkvp qsisvvtaee malhqpksvk ealsytpgvs vgtrgasnty
          121 dhliirgfaa egqsqnnyln glklqgnfyn davidpymle raeimrgpvs vlygksspgg
          181 llnmvskrpt teplkevqfk agtdslfqtg fdfsdslddd gvysyrltgl arsanaqqkg
          241 seeqryaiap aftwrpddkt nftflsyfqn epetgyygwl pkegtveplp ngkrlptdfn
          301 egaknntysr nekmvgysfd hefndtftvr qnlrfaenkt sqnsvygygv csdpanaysk
          361 qcaalapadk ghylarkyvv ddeklqnfsv dtqlqskfat gdidhtlltg vdfmrmrndi
          421 nawfgyddsv pllnlynpvn tdfdfnakdp ansgpyriln kqkqtgvyvq dqaqwdkvlv
          481 tlggrydwad qeslnrvagt tdkrddkqft wrggvnylfd ngvtpyfsys esfepssqvg
          541 kdgnifapsk gkqyevgvky vpedrpivvt gavynltktn nlmadpegsf fsveggeira
          601 rgveieakaa lsasvnvvgs ytytdaeytt dttykgntpa qvpkhmaslw adytffdgpl
          661 sgltlgtggr ytgssygdpa nsfkvgsytv vdalvrydla rvgmagsnva lhvnnlfdre
          721 yvascfntyg cfwgaerqvv atatfrf
    

     

    Latest update: October 15, 2009


    Ralf Koebnik
    Institut de recherche pour le dèveloppement
    UMR 5096, CNRS-UP-IRD
    911, Avenue Agropolis, BP 64501
    34394 Montpellier, Cedex 5
    FRANCE
    Phone: +33 (0)4 67 41 62 28
    Fax: +33 (0)4 67 41 61 81
    Email: koebnik(at)gmx.de
    Please replace (at) by @.


    Home Back to previous page