Monarch geneset OGS2.0

DPOGS201950
TranscriptDPOGS201950-TA4326 bp
ProteinDPOGS201950-PA1441 aa
Genomic positionDPSCF300244 - 42974-138803
RNAseq coverage305x (Rank: top 37%)
Annotation
HeliconiusHMEL0037840.049.07% 
BombyxBGIBMGA000164-TA0.058.82% 
Drosophilasns-PB0.052.17% 
EBI UniRef50UniRef50_Q0E9F20.052.17%Sticks and stones, isoform B n=20 Tax=Diptera RepID=Q0E9F2_DROME
NCBI RefSeqXP_972469.20.059.80%PREDICTED: similar to sticks and stones CG33141-PA [Tribolium castaneum]
NCBI nr blastpgi|1892351280.059.80%PREDICTED: similar to sticks and stones CG33141-PA [Tribolium castaneum]
NCBI nr blastxgi|1892351280.053.34%PREDICTED: similar to sticks and stones CG33141-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055152.2e-13protein binding
KEGG pathwaymmu:221381e-32 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[935-1067] IPR0089577.8e-22Fibronectin type III domain
[955-1043] IPR0137832e-17Immunoglobulin-like fold
[952-1032] IPR0039612.2e-13Fibronectin, type III
[33-126] IPR0035996.3e-11Immunoglobulin subtype
[860-948] IPR0130981.9e-10Immunoglobulin I-set
[864-938] IPR0035986.7e-08Immunoglobulin subtype 2
[571-646] IPR0131621.3e-07CD80-like, immunoglobulin C2-set
Orthology groupMCL10875 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201950-TA
ATGAAAATCGTTTTAGCATCAGCATTCACACGCAAGCTCATAGCAAGGTCACTTCCCATAATCGCCTGTTTCCAACAGCAATACTTCAGGGTCCCACCAAAGAGTTTGAGGGTCCAGGAAGGATCAGAAGCCGTTCTAGAGTGCGCCGTTGCTAATCTGGCTGGCCAGGTTCAATGGGCGAAAGATGGATTTGCTCTTGGCTTCTCATCAGTTATACCGGGATATCCTCGCTACACGATGTTCGGGGATCGCCGTCACGGTATCTACAACCTTCGCATCATAAACACCACTTTGGAAGACGATGCTGAATATCAGTGTCAAGTTGGACCAGCCCAAATGCACAAAGTCATTAGAGCGAACGCCAGCCTTACAGTTATATCGCCACCCAATGCCGTGGAGATCGTGAATCATCCCCACAATTCCAAGCTGGAGGTTAAGGAAGGAGAGGATGTGGTGCTAGAATGTCAGGTGAAAAACGCCAAACCAGCTGCCAAGATAGTGTGGTACAGAGGAAACCAGGAGATGAAAGGGGATAAAGTTTCAAGCGAAGACATCAAAGAAGTCGAAGCCGCTAACGGAAATCCCAAATCAACGAGATACACAACAATATCCAGAATCAAAGTGATACTTCGCACGACACGTCACAAAGACGAGAGTCCAGACAAAAAAGTTCACTTCAAAGCTACTTCAGAGGACGATTACGCTGATTTTACTTGTGAAGCACGCCATGAAGCGCTTCAGAGGGACCTGCCGATGCGCAGCACGGTACAGCTCAGTGTACTATATCCACCTGGAGCACCATATATTGAAGGTTATGCAGAGGGCGAAACTGTTAGAAGAGGACAAAGTCTCGAGCTGGTTTGCCGGAGTAGGGGAGGGAATCCGCCAGCACAGTTGATATGGTACAAGAACGGCGAGCAAATACGAATGGCATACAGGACAAGCGGCAGGATGTCCGAAAACGTGTTGTCTTTCAAGGCGGATGCGTCTGATAACAAAGCGAGATACACTTGTGAGGCTAAAAACATCATGATAAGCAACACGCTGAAGGCCGAAATAGATCTCACTGTGCTATTTGCACCATCCCACGTGACGATATCTGGCCCTTCAGAGGCGAGGGTCGGTGATCCAGTACCCTTAAGTTGCAGCACGGCCCCTTCAAACCCAGCTGCTGATATCAAATGGTTGGTCTTAGGTAAACATCACAGGGAAGCAAGCAACAGAACCGTCATATCTCCCGATGGTGGTTGGATCACCACATCTAATATCACAGTGGTGGTGGAGCCACATCGGCGGTCGATCGTCGTGGTATGCCACGGCATTAACGGACAACTGACTGAGAACGTGGTCGCCACACACACCATCAATGTACTATATCCACCTTCAGCTCCAATGATAACTGGTTACATTCCCGGGACGACTCTCTCAGCTGGGACGGTTCAAAAGCTGTCCTGTATATCCACTGGTGGAAATCCGTTGGCTACCTTGACCTGGTTTAAGAATGACAAAAAGATACATTCAATAACTAAAACCACGGACAAGTCGGTGTCGGCTGAGATATCAATACTGACAAACGTGACTGACAACCAAGCGCAGTATAGATGCGAGGCGACAAACAGCGCCACAGAGATACCGCTCTTTGAAACTGTCACTCTGAATGTACATTTCGCACCCGAGACTGTAAAAGTTAGAGCATCACCCGCCGAGCTAACTCCTGGTATAGAAGGCACACTGTACTGCGACGCCGCTTCTAGCAATCCACCCGCGACGCTATCCTGGTGGAGGGATGGGATACCAGTTCAAGGCCTGCCGATGCAGTTAAAGAAGGGTCTCCACGGTGGTACCGTCTCTACCGTAGAGTTAAAGCTGAATATCACCAAGGAGCTAAATGGCGCCGTTTATACCTGCCAGGCTTCAAACGACGCTCTACAAAGGAGCGTCCACGACGCTTTGACTCTTAAAGTATTCTATCCGCCGATATTCGACGACACGCCCCTCTCGATTGTGGGGGTTGAAAACGACCCGTTGGTTGTGATGCTGCGAGCTGACGGGAACCCCTCCAGCATCACGTACACCTGGACCAAGGACGGTCTCCCGGTCACACAAGCTTCATACAGCAGTGCCAACGATCGTATCGTCTCGTCGGGCGGGACTTTGAATATGACTCGTGTGTCACGACACGATGCTGGGACTTACTCCTGTGAAGCTCTGAACGCTTATGGCAGCGCTCGGATTAACATAACAGTTAATGTGCACTATCCAGCCGATATTAAATCTGTCTGGCAGACAGGTATTGTGGATCCTAATGACAACGCTGTACTGGCGTGTACGGCCAGCGGGAACCCTTTAACATCGGATCACATAAAATGGGAGCGCAAAAACTATGACATGTCGACGAAATTAGTGACATTTGAATCTAAGAACCAAACAAGCTATCTAACGATAGAGAGAGCGGCGAGAGAGGACGTCGGCTCGTTTGAATGCGTGGTGAACAACGGCATCGGCGGCGAGACTCGCCACGAAGTGATGTTGGTTGTCAAATTCAAACCTGAAATGAACACCTCGCCGACACTCGCCAAATCCGCGTCCAATGTCGGCCAAGTCGGGCGGCTAACTTGCAAATGCAAGTCCGCCCCGGCACCTAACTTCACGTGGTCGAAGGGCGGCGTTAAGCTCCCCGTGAACACGTCTACGAAGTACTTCGCTGAGTATCACAGGAACGACCAGATCACATACACCTCTGTGTTATTAATAAACGACATAAGCACGTCGGATTACGGCGCATACGAGTGCGGGGCGAGGAACGACCTCGGCTTCGGGTCAGTCTCCGTAAAATTGGATGTCACCGGTCCGCCAGACCCGGTGTCGTCTATAGTTGTAACCAACGTCACCCACGACACCATAACCCTAGAGTGGGTGCCAGGCTTCGACGGCGGACTGACCTCCTGGTTCAGAGTTCGTTATCGCAAACCCCACGACTCTACATACACGTATCACGACGTAACCCCCAACACGACCCACTATACTGTGTCGGGGTTGGAGCGACACACGGACTACGTACTGTCCGTCATGGCGGTCAACGGAATGGGGGAGAGTCGCTACAGACCTGATGATACGAAGGCTACGACCCTCAGCATTAATATTGGTTGTTCTATTCCACGCTTCATAGACATCATCCCACCAGTACAGTCTAGTTCCGAGTACAACTCGTATAATGTGCCGGGCGTGTTCATAATAACCGGCACCTTGGTCGGTACAGCGCTCATATTGCTCAACGTGCTGCTGGTGGGATTTTGTCTGCACAGGAGGACGAACAAACGTATACGAGCTTCATCAGAAGTTGGTGAACTGAACGTAGTTTCTACGGAGCATGTAGAAACGGCCGACGTGTCTAAATCAGTAGTTTTATACGTGTGCGTGACAGTCGCTGTGTTAGTTATTATAAACGCTGTTTTAGTTGCTTGTTTTGTATTGAAACGACGCTCCAAGCGCTCCAAAGAGCAAGCTGGGCAGTCGTCAAAATCAACGCCCATAGAAATGTACGCTCCGTCATCTTACAACGACACTATGGGCGAAACACTAAGCTCAGTGTCAGAGAAATCTGAAACGTATTCACAAGATGAAGCGCCCCCCGTCCCTGACGTACCCAGCATGCCGAGACACATGATGAACCAGTCGGATTCTTATCTCCTGGATGAGAACCTGGTCCCTCCCCCCTTAGACTACCCTCCGCCGAACTACGTGTATGACGAACACGCGAGGACTCTACCACATCCACACAGACTACGAGAGGTCCGGGGACACAGCACCCTCGGGCGGACGGCCGGTAAACAAGCGTACGTACCGACGCCGAGTCCGATGCCACCATTAGACGGCTCCTACTACAATATGGCGTCCGATAGATACCTGTCTTACCCACCACTCATTGGAGAATATTTACAACAGCAAGCCGGTAGAACTCCGACTCCGCCACAGCAATACTCTAGAGATAATCATCTGAGTCCACCGAACTGTGGAATCGATGGTGAACGAGCTGTCCCCCCTGATGTGACTGTTCTTCACCCCCCAGTATGTACACAGCAATTTCCGTTAAACCCATCTCTATCTGTGAAGCAACCGCAGTCCATACTAAAAGATCCGTCGAGGCATAAATATAGTAACCAATACGGCAGTCCCATATCCTCTAGTTCGCCTCAGAACCAAAGTCAAATATTGACAGTTCAGAATTTAACGGATGTACCACAGTACGGTACCATAAAGAAAGACAAGAAACAAAACGTCACTATAGACGAATCATTCAACAAACAACAAACGCACGTAGTTTAA

Protein sequence:

>DPOGS201950-PA
MKIVLASAFTRKLIARSLPIIACFQQQYFRVPPKSLRVQEGSEAVLECAVANLAGQVQWAKDGFALGFSSVIPGYPRYTMFGDRRHGIYNLRIINTTLEDDAEYQCQVGPAQMHKVIRANASLTVISPPNAVEIVNHPHNSKLEVKEGEDVVLECQVKNAKPAAKIVWYRGNQEMKGDKVSSEDIKEVEAANGNPKSTRYTTISRIKVILRTTRHKDESPDKKVHFKATSEDDYADFTCEARHEALQRDLPMRSTVQLSVLYPPGAPYIEGYAEGETVRRGQSLELVCRSRGGNPPAQLIWYKNGEQIRMAYRTSGRMSENVLSFKADASDNKARYTCEAKNIMISNTLKAEIDLTVLFAPSHVTISGPSEARVGDPVPLSCSTAPSNPAADIKWLVLGKHHREASNRTVISPDGGWITTSNITVVVEPHRRSIVVVCHGINGQLTENVVATHTINVLYPPSAPMITGYIPGTTLSAGTVQKLSCISTGGNPLATLTWFKNDKKIHSITKTTDKSVSAEISILTNVTDNQAQYRCEATNSATEIPLFETVTLNVHFAPETVKVRASPAELTPGIEGTLYCDAASSNPPATLSWWRDGIPVQGLPMQLKKGLHGGTVSTVELKLNITKELNGAVYTCQASNDALQRSVHDALTLKVFYPPIFDDTPLSIVGVENDPLVVMLRADGNPSSITYTWTKDGLPVTQASYSSANDRIVSSGGTLNMTRVSRHDAGTYSCEALNAYGSARINITVNVHYPADIKSVWQTGIVDPNDNAVLACTASGNPLTSDHIKWERKNYDMSTKLVTFESKNQTSYLTIERAAREDVGSFECVVNNGIGGETRHEVMLVVKFKPEMNTSPTLAKSASNVGQVGRLTCKCKSAPAPNFTWSKGGVKLPVNTSTKYFAEYHRNDQITYTSVLLINDISTSDYGAYECGARNDLGFGSVSVKLDVTGPPDPVSSIVVTNVTHDTITLEWVPGFDGGLTSWFRVRYRKPHDSTYTYHDVTPNTTHYTVSGLERHTDYVLSVMAVNGMGESRYRPDDTKATTLSINIGCSIPRFIDIIPPVQSSSEYNSYNVPGVFIITGTLVGTALILLNVLLVGFCLHRRTNKRIRASSEVGELNVVSTEHVETADVSKSVVLYVCVTVAVLVIINAVLVACFVLKRRSKRSKEQAGQSSKSTPIEMYAPSSYNDTMGETLSSVSEKSETYSQDEAPPVPDVPSMPRHMMNQSDSYLLDENLVPPPLDYPPPNYVYDEHARTLPHPHRLREVRGHSTLGRTAGKQAYVPTPSPMPPLDGSYYNMASDRYLSYPPLIGEYLQQQAGRTPTPPQQYSRDNHLSPPNCGIDGERAVPPDVTVLHPPVCTQQFPLNPSLSVKQPQSILKDPSRHKYSNQYGSPISSSSPQNQSQILTVQNLTDVPQYGTIKKDKKQNVTIDESFNKQQTHVV-