Monarch geneset OGS2.0

DPOGS211677
TranscriptDPOGS211677-TA3048 bp
ProteinDPOGS211677-PA1015 aa
Genomic positionDPSCF300151 + 372996-379699
RNAseq coverage1969x (Rank: top 6%)
Annotation
HeliconiusHMEL0158810.051.35% 
BombyxBGIBMGA001370-TA0.064.42% 
Drosophilarhea-PB0.063.73% 
EBI UniRef50UniRef50_UPI00021A7D5D0.073.99%UPI00021A7D5D related cluster n=1 Tax=unknown RepID=UPI00021A7D5D
NCBI RefSeqXP_972434.20.071.14%PREDICTED: similar to AGAP007474-PA [Tribolium castaneum]
NCBI nr blastpgi|3320269110.074.60%Talin-1 [Acromyrmex echinatior]
NCBI nr blastxgi|3320269110.074.28%Talin-1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00017266.1e-68ruffle
GO:00059256.1e-68focal adhesion
GO:00070166.1e-68cytoskeletal anchoring at plasma membrane
GO:00052006.1e-68structural constituent of cytoskeleton
GO:00055151.6e-36protein binding
GO:00054885.1e-29binding
GO:00051582.1e-05insulin receptor binding
KEGG pathwaytca:6611620.0 
 K06271 (TLN)maps-> Focal adhesion
InterPro domain[2-687] IPR0157100Talin-related
[506-676] IPR0152246.1e-68Talin, central
[85-323] IPR0197491.1e-59Band 4.1 domain
[318-409] IPR0119931.6e-36Pleckstrin homology-type
[204-318] IPR0197482.4e-32FERM central domain
[207-316] IPR0143525.1e-29FERM/acyl-CoA-binding protein, 3-helical bundle
[93-207] IPR0189793.4e-18FERM, N-terminal
Orthology groupMCL10293 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211677-TA
ATGGCGACGCTGTCCCTGAAGATATCCCTGGAGGGGGGCAAGGTGGTGAAGACGATACAGTTCGAGCCGTCCACCAGTGTGTACGACGCCTGCAGGATCATCAGAGAGAAGATACTGGAAGCCAATGACAATGACCCTAAAGAGTACGGCCTGTTCCTGGCCTCCGAGGAGGATAACAAGAAAGGCATCTGGCTGGAGGCGACTCGCAGTCTGGACTATTACATGCTGAGGAACGGGGACCTGCTGGAGTACAACAAGAAGACCAGGAACCTCAGGGTCCGGATGTTGGACGGCACGGTCAAGACTCTGCTGGTGGACGACAGTCAGATAGTGGCCAACCTGATGGTGGTGATCTGCACCAAGATAGGCATCACCAACTACGACGAGTACGGCCTGGTCAGGGAGGAGGTGAAGGAGGACGCCGACCCTTGCGAAAGACCCAACTACGGCACCCTCACGCTGAAGAGGAGACATCATGAGAAGGAGAGGGACGCCAAGATGGAACAGCTCAGGAAGAAACTCAGGACTGACGATGAAGTAAACTGGGTGGAGCCGTCCAAGACTCTCCGCGAGCAAGGCATCGAGACCAGCGAGACGCTGCTGCTTCGTCGGAGACTGTTCTTCTCCGACCGGAACGTGGACTCTCGCGACCCCGTGCAGCTCACTCTGCTGTACGTGCAGGCCCGGGACGCCATCCTGGCCGGCACACACCCCATCACACAGGACAAAGCGTGCGAGTTCGCGGGTATTCAGTGCCAGATACAGTTCGGAGATCACAAGGAAGACAAACATACGCCCGGATTTTTAGACCTGAAGGAGTTCCTGCCAGCGTCTTACGTGAAAGTGAAGGGCATCGAGAAGAAGGTGTTCCGCGAACATCGCAAGCACGCCGGCCTCAGCGAGCTGGACGCCAAGGTGCTCTACACCAAGAGTGCCAGGGACCTCAAGACTTATGGAGTCGCATTCTTTTTGGTCAAGGAAAAGATGAAAGGCAAGAATAAACTGGTTCCTCGTCTCCTGGGCGTCACCAAGGACTCCGTGCTCCGCCTGGACGAGAAGACCAAGGAGATCCTGCAGACCTGGCCGCTCACCACCGTCAGGAGGTGGTGCGCCAGTCCCAACACCTTCACGCTGGACTTCGGAGACTACAGCGACCAGTACTACTCGGTGCAGACGACGGAGGCCGAACAGATCCTGCAGGTGATCGCGGGCTACATCGACATCATCGTGAGGAGGAGGCGGGCCAGGGACCATCTCGGCCTGGAGGGAGACGAGGGATCCGCCATGTTGGAAGACAGCGTCGCGCCGTCCAAGGCTAACATCATCCAACACGACACCTTCAAGTCGGCCAAGCCCAATCAAGAATCAGTGGCCAAGCCGGCCGTCCTCAGACCTGGAGCGGAAGGGGCCAAGCCGTTCTCGGTGGCGCACGTGACGGGGGCCCAGCAGACCACGCTCTCGGGGAGGGTCGTCACCGGACACGCGCCGCCAGCTGCGACGCAGGTTCAGCAAACAAAAATCACTTCTATACTGACGGAGCCGCAGAGGGCGTTGTTGTCTACCATCACGAGCGGCAAGGAGATCATCAAGCAGACGGAGGCCGGGCTCACCAGGACGTCCTTGGAGCCCCCGAGCGGCGACGCGGGCTCGATACGCTGGCGGGAGGCCAGGCTCGGCAGCAGCAAGCAGCTGGTGACGTCACACATCGCAGCAATGAACGCAGCGACAGCACAGGTCGTCACCCTCACCGCCGGGCCGGCGGAGGAGGTGGACCACACGGCGGTCGGCGCCGCCATCACCACCATCACCACCAACCTGCCCGAGATGACGAAGGGAGTCCAGATGATCGCTGCCCTCATGGAGGACGAGCACTCCGGGGACAACCTCCTGGACGCCACCAGGAAGCTCTGCACCGCTTTTTCCGACCTGCTGAAGGCGGCTGAACCTGACACTAAGGAGCCTCGCCAGAACCTGCTGAACGCAGCGTCTCGTGTGGGCGAGGCGTCCACCGGCGTCCTGCACACCATCGGCGAGGAGACCGACGACGACAAGGAGACTCAGGTAACCTCCGGGGACCTGTACACCCCCTCCATATACACTGATAGCGGTAAGCTAGAGGACGACGGCATATACGAGGACGTCGACACCACCGACTGGAAACTGCTGGGGGTTATACAGGAGGAGTCCGACAGCGACTACTACAGGAGTGTCGACTATTGTAACACCTACTACAGGAACAAAAGGCAGCGGGAACACTCTGTATACGACACGCTCGACAAAACCTCGCTGAGACCTGACACGCCGCCCAAACTACAACCGAAGAGGGTCGCCGACAAGATATACGACTTTAAAAAAGACTTCTCCAACCATCACTACGCCAATATAGACCACGAGAAAAACTCGGCGCTCGACAGACGGACCAAAAACGAAATTCTGAGACAGAAGTTCTTCGCCGAATACAACAACGTCAACAAATGCAACAAAAATGAGTTCAATGACAAAATAAATAGACTGAACGAGAAGAAAGCAGGGAGTCAGAGCTGGAGAGAAGGCGCGAAGACGGGGGACAGGGACAAGGCCGCGCTCAACCTGAAGCTGTTTGACCTGGAGAAATATAACAGAGTGTTGACGACCACTGAAACATACAGGGAGACCAGCGAGGCCAGCTTCACCAGCTTCACCAGCAAGACGCTGAACACTAAGACGGACCACTCACACTCAGACACGCTCACCAAGAGGAGCGAGGCCTGCACCAGCACGCGGCGCACGGACATGGACGCCGGCCCCCTGAACAACTCCGCGCCGACTAAAGGACAGACGGAGATCAGAATCACGTACAACCCCATACTGATGACGAGTCAGAAACACGGGGACGAGAGATGTCTCCCGGGAGACGAGGCCCGGCCCAAACTACAGCTGACAGATGTTTGGAGAGAAGATTGGTTGAACATACTGCTGTTACTGGCGTTCATACTCATAGCTGCAGCCTGGTTCATAGACTGGCTCCTGAAGCGTGTCTGCAGCGAGGACTGCCATCAGATGACGTGA

Protein sequence:

>DPOGS211677-PA
MATLSLKISLEGGKVVKTIQFEPSTSVYDACRIIREKILEANDNDPKEYGLFLASEEDNKKGIWLEATRSLDYYMLRNGDLLEYNKKTRNLRVRMLDGTVKTLLVDDSQIVANLMVVICTKIGITNYDEYGLVREEVKEDADPCERPNYGTLTLKRRHHEKERDAKMEQLRKKLRTDDEVNWVEPSKTLREQGIETSETLLLRRRLFFSDRNVDSRDPVQLTLLYVQARDAILAGTHPITQDKACEFAGIQCQIQFGDHKEDKHTPGFLDLKEFLPASYVKVKGIEKKVFREHRKHAGLSELDAKVLYTKSARDLKTYGVAFFLVKEKMKGKNKLVPRLLGVTKDSVLRLDEKTKEILQTWPLTTVRRWCASPNTFTLDFGDYSDQYYSVQTTEAEQILQVIAGYIDIIVRRRRARDHLGLEGDEGSAMLEDSVAPSKANIIQHDTFKSAKPNQESVAKPAVLRPGAEGAKPFSVAHVTGAQQTTLSGRVVTGHAPPAATQVQQTKITSILTEPQRALLSTITSGKEIIKQTEAGLTRTSLEPPSGDAGSIRWREARLGSSKQLVTSHIAAMNAATAQVVTLTAGPAEEVDHTAVGAAITTITTNLPEMTKGVQMIAALMEDEHSGDNLLDATRKLCTAFSDLLKAAEPDTKEPRQNLLNAASRVGEASTGVLHTIGEETDDDKETQVTSGDLYTPSIYTDSGKLEDDGIYEDVDTTDWKLLGVIQEESDSDYYRSVDYCNTYYRNKRQREHSVYDTLDKTSLRPDTPPKLQPKRVADKIYDFKKDFSNHHYANIDHEKNSALDRRTKNEILRQKFFAEYNNVNKCNKNEFNDKINRLNEKKAGSQSWREGAKTGDRDKAALNLKLFDLEKYNRVLTTTETYRETSEASFTSFTSKTLNTKTDHSHSDTLTKRSEACTSTRRTDMDAGPLNNSAPTKGQTEIRITYNPILMTSQKHGDERCLPGDEARPKLQLTDVWREDWLNILLLLAFILIAAAWFIDWLLKRVCSEDCHQMT-