Monarch geneset OGS2.0

DPOGS214122
TranscriptDPOGS214122-TA3369 bp
ProteinDPOGS214122-PA1122 aa
Genomic positionDPSCF300014 - 1604136-1609884
RNAseq coverage1680x (Rank: top 8%)
Annotation
HeliconiusHMEL0113860.055.62% 
BombyxBGIBMGA006171-TA0.069.83% 
DrosophilaCG42389-PE5e-13435.59% 
EBI UniRef50UniRef50_D6WKT90.041.27%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKT9_TRICA
NCBI RefSeqXP_973615.10.042.14%PREDICTED: similar to AGAP009522-PA [Tribolium castaneum]
NCBI nr blastpgi|2700065950.041.27%hypothetical protein TcasGA2_TC010469 [Tribolium castaneum]
NCBI nr blastxgi|2700065950.040.96%hypothetical protein TcasGA2_TC010469 [Tribolium castaneum]
Group
Gene OntologyGO:00055153.8e-13protein binding
KEGG pathwayecb:1000538446e-31 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[792-901] IPR0089571.4e-23Fibronectin type III domain
[711-804] IPR0137831.4e-21Immunoglobulin-like fold
[630-701] IPR0039613.8e-13Fibronectin, type III
Orthology groupMCL12756 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214122-TA
ATGGTAGGCGTGGGCGTAGCGGAGGGCGGCGGGGACGGCTACTACGGCGAGTACTACCCCCCTGAGCAGTACTACATGCCGGAGATGTGCCCTCACCCTCAGCACCCACAGCATGCGCATATGGCGTGCACTGTTCACGCTGAGTATGGTGGAATGCCGGTGGTAACGTCAGCAACAATGATGCCGCCTCTTTTACCACCGGTGATGGACGAGAATATGAGACATTACCTGGTGCCGCATCCTCATGCACAGCCGCACCCACACCACGCGGCGCACCACCAGCCGCCGCACCACCAACCACCCCATCACCAACCGCCCCCTCATCATCAACCACCGCATCATTTTGGACCAACAAATGGTGCTGCGGGTCCCCAACACTTTTATGGAGGTGGCTATCCGACTCACTTCCATCACGTTCCACCCCACCACATGCAGCACTCACCACCACCTCCGGTATACCACAAGGATGAACGAACTCAGCGGCAATACTCTAAACTCAAACAAAAGCTGGAACGCAAACACGTTAATAGGAATAATGGAATAGAAGTAAATTCTGGTGCGAGCACGCCGTCATTATCACCAAGGAAAGAGTCAAATGGTCGCGGTGGTAGTGGGAGTGGTGGAGCGTCGTCTGGCGCTTGGTCTGAGGGCGAGGGGTCATCAGCTGGCGCCTCAATCCAGGGTGATGATGAGAATGATACACAGGCACTGCTAGATCTTGTGTCTGCTACCCGAACACCGCAAGTTAGTGACGTGACTCCAACAAGTGCTCTCGTGCAATGGAATTCCCCTCTACCAGAAGGTGTCACTCTCCCAAATGTGGACCTCACTTACGACCTCCTGCTTGGAGACCGGGGACGGTATAAAGCTATATACAGTGGTTCATCGCTATCGTGTCGCGTAAGAGACTTGAGACCCGGATGCGAATACTCAGTGTGTCTGCAAATCCGTGCGGGTGAGTTGACGGGTGCGGCGAGTGAAGCGGCCACATTCCGCGCTCCACCGGCCCCGCCCGAACGACTGCCGGCGGCGCGCGTTACACAGAGAGCACGAACATCGCTGTTGTTACGCTGGCCCTCCGCCACCGACAACGGAGCGCGAGTCACACACTACCTGCTGGAGATGGACGCCGGGGAGGGCTTCGTGGAGCTCACCAGGCCCCGCACGAGACAACACACCGTCAATAATTTGCAACCTCAGACGCGTTACCGATTCCGGATCGCGGCCGTCAACGAGTGCGGCCGCGGGGAATGGAGTGAAGAGACCGTTGTTTGGACTACGGGGTCTCCGCCGCCTGCTCCCGGCCCACCGACGCTGGTTACCGCCTCGCCGACGTCATTAACCCTGACGTGGCAACGCCGGGCGGAGGAGGAGTTCGTCCTGCAGATGGACGATGTATCACGAGGACACGGATTCCTACCCGTTTACAGCGGGTCGGACTGCACTTACGTGTGTGACGGACTCAGGCGAGCGACCGATTATAGATTTCGTTTGAGGAGCGAAACCGTCGACGGTCAAGGACCGTGGTCTGTGGAGGTCACCTACACCACGCCGCCCGAGCGACCCTGTCCGCCGAGCAGACCCACACCCCGCGGGAAGATACACTCTCGCGCGATACGGTTGAGGTGGGATCCTCCCACCGATAACGGCGGAGCCGCCGTGGACACTTACACTTTAGAAATTGACGGTGGAGAGGGTTACAGCCTCGCTTATCAAGGACCCGAACGTGAGGCCCACTGTGATCGCCTCCTACCAGGAACACAGTACCACGCGCGAGTCAGATGCTCGAACGTGGCAGGCATGAGCGACTGGTCAGCTAGCGAGACAGTTACCACAGAGGCGACCTGTCCCAGCGCGTGTCCCGCGCCGGAGACCAGCGGAGCCAGCCGCGCCACGCAAGCCACCGTGCGTTGGAAGGCGCCCGAATGCACTGGAGGGTCTCCACTTACAGAATATCGCCTCGAACTGGCTGATACTGACGGTCTGGTACGTTTAGTACACGTGGGCCCTGAATCTGAATGTGTCGTTCGGGATTTACTTCCCGGCCGAGAATACCGAGCGTGGGTGACAGCATGCAATAGGGTCGGTGCCGGGCCTCCATCCCCAGCTTTGAGGTTCACCACACAGCCCGCACCTCCCGACGCGCCCGAACCTCCTGTCGTTCATATAGAGAGCCCCCGGACGGCTCTGGTCGAGTGGACCGCTCCCGCTAACAATGGCGCTCCTATTATCGATTTCCGTCTCGAAATGAGTGCGAACAACGTAGACTGCGCCTTCGCCGAGGTATATCGCGGACTGGACACCGTCTGTTCGATAGGGAAACTGACTCCTTTCACGCCCTACTTCTTTAGGGTGAGAGCTACGAATTCGGCCGGAAGAGGCCCGCGCTCGGCGGCCAGCACCGCTCTCACTCCTCGTGCTGTTCCCGCGGCGCCCACGGGGCTTCGACACGAAGCAACCTGTGATTCTCTGAAACTACACTGGCGAGTACCGGCAAATCACGGAGCGGACATTCTTAAATATCGCGTGGAAGTAGACGACACCGCCTTCGATACAGATGGACCCATTCCAGAGAGGCTCGTGGAGGGACTCGAGCCAGACACCGTGTATCGAGTGAGAGTGGCGGCGGTCAACGAACTTGGACCCGGAGATTGGTCGGAAGAGGCGCTCGCCTCTACCCGACCGCGACCACCAGCGCCGCCCGTAGTGAAGTTCGCTCAGGCCGCGCACAATCACCTCCGACTGGAGTGGGCCGGTCGGGAGGGGACACAGTACTGCGTGGAGATGCGCGCGCCTGACGCCCGGGAGTTCCGTCCGGTGTACCGCGGTTACGCACATTCCTGTAAGGTGAAGAAGTTGCGCGAAGCGACGACCTACACGTTCCGGATACGAGCCAGCGACGAGCGGGGCGGGCGCGGCGTGTGGTCGTCGCCGCTGACCGCTCGCACTGCGTCCGCGCCCCCCGCCGCGCCCTCCGCACCCACCGTCACGCTGGTGACACCGCGGGCCGCGCTCGTCGCTTGGGACCCGGTCGACGACGCCGACTACGTGCTGCAGAGCGCGCGCGGCAAGGACGCTGTCTTTAAAGAGGTTTACACAGGCGACGCGTCGCAGTTCCAAATGGAGGAGTTGGAGTACGGCGTGGAGTACCAGGTGCGGGTGTGCGCGACCCGCGGCGGGCTGTCCAGCTCGTGGTCGCCGTGCTCTAAGGTGGTGGTGCCACCGCCGGTGTCGGGTCGGCCGCGTCGTGTCCGTCCGTCCCGGCCGCTGTCCGCGAGTCACGCGGCGCTGATGATGGCGGCCGGCTTCTTGCTGGTGGCGGTCTTGGTGGCTGTCTTCCTTCAGAGCCTGGTGGAGCCTCGCCCGTGA

Protein sequence:

>DPOGS214122-PA
MVGVGVAEGGGDGYYGEYYPPEQYYMPEMCPHPQHPQHAHMACTVHAEYGGMPVVTSATMMPPLLPPVMDENMRHYLVPHPHAQPHPHHAAHHQPPHHQPPHHQPPPHHQPPHHFGPTNGAAGPQHFYGGGYPTHFHHVPPHHMQHSPPPPVYHKDERTQRQYSKLKQKLERKHVNRNNGIEVNSGASTPSLSPRKESNGRGGSGSGGASSGAWSEGEGSSAGASIQGDDENDTQALLDLVSATRTPQVSDVTPTSALVQWNSPLPEGVTLPNVDLTYDLLLGDRGRYKAIYSGSSLSCRVRDLRPGCEYSVCLQIRAGELTGAASEAATFRAPPAPPERLPAARVTQRARTSLLLRWPSATDNGARVTHYLLEMDAGEGFVELTRPRTRQHTVNNLQPQTRYRFRIAAVNECGRGEWSEETVVWTTGSPPPAPGPPTLVTASPTSLTLTWQRRAEEEFVLQMDDVSRGHGFLPVYSGSDCTYVCDGLRRATDYRFRLRSETVDGQGPWSVEVTYTTPPERPCPPSRPTPRGKIHSRAIRLRWDPPTDNGGAAVDTYTLEIDGGEGYSLAYQGPEREAHCDRLLPGTQYHARVRCSNVAGMSDWSASETVTTEATCPSACPAPETSGASRATQATVRWKAPECTGGSPLTEYRLELADTDGLVRLVHVGPESECVVRDLLPGREYRAWVTACNRVGAGPPSPALRFTTQPAPPDAPEPPVVHIESPRTALVEWTAPANNGAPIIDFRLEMSANNVDCAFAEVYRGLDTVCSIGKLTPFTPYFFRVRATNSAGRGPRSAASTALTPRAVPAAPTGLRHEATCDSLKLHWRVPANHGADILKYRVEVDDTAFDTDGPIPERLVEGLEPDTVYRVRVAAVNELGPGDWSEEALASTRPRPPAPPVVKFAQAAHNHLRLEWAGREGTQYCVEMRAPDAREFRPVYRGYAHSCKVKKLREATTYTFRIRASDERGGRGVWSSPLTARTASAPPAAPSAPTVTLVTPRAALVAWDPVDDADYVLQSARGKDAVFKEVYTGDASQFQMEELEYGVEYQVRVCATRGGLSSSWSPCSKVVVPPPVSGRPRRVRPSRPLSASHAALMMAAGFLLVAVLVAVFLQSLVEPRP-