Monarch geneset OGS2.0

DPOGS208019
TranscriptDPOGS208019-TA3648 bp
ProteinDPOGS208019-PA1215 aa
Genomic positionDPSCF300203 - 315558-339302
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0121410.068.12% 
BombyxBGIBMGA001493-TA0.052.02% 
Drosophilastj-PC1e-9325.19% 
EBI UniRef50UniRef50_Q7Q7584e-11428.98%AGAP005490-PA n=9 Tax=Diptera RepID=Q7Q758_ANOGA
NCBI RefSeqXP_315491.48e-11528.98%AGAP005490-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582942502e-11328.98%AGAP005490-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582942507e-11629.12%AGAP005490-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055158.5e-07protein binding
KEGG pathwayaga:AgaP_AGAP0054902e-114 
 K05316 (CACNA2DN)maps-> MAPK signaling pathway
InterPro domain[114-224] IPR0136084.8e-20VWA N-terminal
[256-373] IPR0020358.5e-07von Willebrand factor, type A
Orthology groupMCL26672 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208019-TA
ATGACGCAACAACTTCTCGATAGACCAGTCCTGTTCTTGTTCCGATGGGCACTCAGTGTCGAATATTTTATCCGCTGTTGCCGCAGTCAGCAACAGAAATTACAAGAATGGGCTAAAGAAATATCTGATGCCTTATATATAAACGAAGAAGAAGTTGTGCACAGAGATATTTTGTTAGAAGGATTCTCAGACATCCAAATTAAAACAAGAAACGGCACGGCCATTGCCGAACAGGCAGCTAAAGCTCTAGAAGAGCTTCTAGATCGAAGAGGCAAAGCCGCAGAAGCTATCATGAGGAAGGCGGAGCAGTTAGCGACCGACAGATCTGACCCACCCAACGACTATTATTATGATAACAGTGTAGACATAAACGTCCTTAAGAAAGTACAGAAACCTGAAAACGAATGGGAACTAGTTTTAAACTGCAGCAGTTTGAATAAAGTAGAGGTTTACAATAGCGCACACTTCGACGCTCAAGTTTCTTTAGAACATACGAGCGTACATGTTGCTGTCGAGGTTTTTGAATGTGATCCCAGAGTACTCCCTGATATATATTGGTCGGAAGGGCTGTTTGAAGCATTTCGAGAAAATTACGCTCAAGACGCCACACTCGACATGCAGTATATGTGTAGTGCTAAAGGATTTTTGAGACACTATCCTGCTGCACTGTGGGATAGTATGTACAAATTGAAGATTGAAGATGGAGAAGCGTTGTACGACTGTAGATTGCGACCGTGGTACGTCAGTGCAAGCGGAGCTCCAAGAGATATTCTTATTTTACTCGATTCCTCGGGGTCTATGAGTAACTCGTCTAACCTTCTCATAGCGGAACAGTTGACTCTGGCCCTCCTCAGCGCCCTCACAGACGACGATCAAGTCAACGTCCTCCGATTCAACGAAATTGTAGAATCTCCCATTCCATGCTTCAATGGAAAACTTGTGCCTGCCAACCATGTGAACTCAGCGGCCATGATGGATGCCCTCCAATATCAAAACACCTCCTGCGAGACCTGGATGGATCATGTGCTGGTGTACTCCGTTAATTTGCTGAAAGAAAGAAAAAAAGCTACAGACAGACCGCCTTCCTGTCAACAGGCCATTGTCTTGATAACAGACAGTCTCTACGAAAATTACACTGACTTAATGAATGTATTAGACCCTGATGGAAGTATCAGGGTGTTTGTCCTATGGCTACATGACCCCAATGGGGTGAGAGATAGTACGCATTTCTATGGAGAATCGGTAAGCTGTTCTCGTGATGGATTCTTTGCGGAGTTGATCACCCATGCTGACGTCACGGAGAGAGTCATGAATATATTAAGAGTACTAGAACGCCCTCTGGTATCTCAGAGGAAGCAGCGTCTGCGAGTCTACAGTGATGTATACGCCAATGTAGAGGATCCAAGACGGGGCGAATATTACTGGCAGCAGAAAGAAAACACCGAACAAATGTATCGTTACACGCAACTCCGTCGCAACAAAGACAAGTTTCTCAATAGCGATCGTCTTTACAGCGACTACCTGCATATGCATAAATTGGAAAAGTTTGGCCAATACTACGAAGGTCAGGATATTAACTACCGACTTCAGGTGACAGTCTCTGTTCCCGTGTTTGACAGCACCACATCTGAAATAGGCGCAGGCGGATCGCTCTTTCTTATTGACCATCGGGGAAACATTGTATTACATGAAAATGCAAAGCCAGTCTTTGACGGTGACATTCTAAAGCCGGGGTATCGAACTGTGGACCTTCTTGATGTTGAGCAACCAGCTGTTGAACACTGGCCCCGGCACTACCCTCAGGAATGGCTGGAGTTTAGAAACACCTTAGTAGTCGAACAGCCTAGTGGAACTAAGACAATGTATGCCAAGAGTATTTTCGATGAAGGTATGAGAGCTTTTTTGGAAATGAAGGAGTACCATTGGAAGAGGGTAAAGAATTACTACACGATCGTGGTGACTCTAACCAAATACAACAATAAGCATGCGGTCCCCGAAGCAAAGTTTACACAGGCTCTAGCTAATGCCGCGATGAATGCATTGCATGGCACAGATTTTTCCGTGCACCCAGACTGGTTGTATTGCCAGCACGTAGACCCCCACTTTGATACACGGGAAGCGGAAGTTCTTCATTTCCTGAGGCGTCGTCGAGATGAACCCAATTTTTCGATGCGAAAGATCAATCACGTGTTCTCACCCATAAAACCGACGTTACTAGAGAAAACTTACCAATGTAACGAGGAACTAATGGCGAGATTCAGTAGAGAAGCAATCGCCACAAACTCGTGGACTAAGATCGCTGAGAGCACAGAACATGCTTGCACAACCTGTCGACTCGGCTCGACGACCGCCTTCTTTGCCAGCGAAAGCGGTCTAACCCGCTGGCAACAGTACCACGCGACAAGTCCCCACGCAGAGCCGCCATCTGGTGGTGAGTGGCCTCGGGGGCCGGGAGAGACTTGGTACCGGCGAGCAAGTGCGACCCCAGACCTCATTGTACATGCGCCCGTACCCCCCATCAGACTGCTAAGGAACAGCTTCATTGAACCACCTGAGTTGGGCGAGCGATACAAGTGGTTGACTGCTGCGCGTATGATCGCGCACGCCAATAAAGGAACCATCGGTGTCGCCGGCTACCACTTCCACCGGCAACACTTGAAGGATGTGCTGAAATCAATAACTGACTTTCCTTGCGATGAAGAAGAAGAATGTGAACCACGCTGCGACGGCGAGGAGTGGAGCTGTGTGCTCGTTGATGAAGGTGGTTGGATAGTATCAGACACTGAGGAAGAAGACGAGGAGCAAGCAAAGGAACCTGTTACCCAACACCTGGCTAATGTCTACCCAACAGCGATGTCAGCGCTGCTAAACGCTAGCATCTTCAAGCTGCATTGGATTCATGACTACCAAGCTGTTTGTTTCCCGTCAACTAAGGAGATAATACGGCCGAAACATAAAAAAGTGAAGTCCAGCGCACCGAATCTACCGTCACTCATACGAAGTTTATGGACTTCGTTAAGTGAAATACTACTTATAAGTCAGGAAATGTTTACTTTTCTCACCCTTCTAACTACAATACCAGACGGTGTAAATGCAGACACGGAAGCAGAAAAAGAGAAACGTCGTAAAAAAATACGTCGGGATTTCGAACGAGAGAAATATGAAAGACTGTTTGATCCTCGGGTACTTGTTAATAGAACACACTTCGCTGCTTGCGACAGATCCAGGGCGCTGTACGTGCTCCAGCGAAACCAAAGAGCTATGGAAGCTCTGAGAAGGAAGCCCCATCTTTGCAAGTGGCCCCTAGTGGGCACTGAAGTACCAAAGACCAACTTGTTGCTACTCGCAATCTACAAAGGGTGCCCTTTCACTGGCAAACCGCTCAACGATCCCTTCATCAACGAACTGGTGTCGCTGGCTGATGATGAAGAAGGTCGCAGTATGGCGGCGCGACTCGCATGTTGGAGAACCCGCGTCCCTCTACCAGCTCGAGCACCCAGCACGCAATGTTTCCCACACCACTATGGAGATGAAGAGGGTTACCGTCAATGCGGTCCGTGGCTGCCGGATCCACCAAAGAAATCAGCCGACGTTAAACACGTTACATTATTAATTAACTTACCAGTGATATTGATGTTATTAGCTTAA

Protein sequence:

>DPOGS208019-PA
MTQQLLDRPVLFLFRWALSVEYFIRCCRSQQQKLQEWAKEISDALYINEEEVVHRDILLEGFSDIQIKTRNGTAIAEQAAKALEELLDRRGKAAEAIMRKAEQLATDRSDPPNDYYYDNSVDINVLKKVQKPENEWELVLNCSSLNKVEVYNSAHFDAQVSLEHTSVHVAVEVFECDPRVLPDIYWSEGLFEAFRENYAQDATLDMQYMCSAKGFLRHYPAALWDSMYKLKIEDGEALYDCRLRPWYVSASGAPRDILILLDSSGSMSNSSNLLIAEQLTLALLSALTDDDQVNVLRFNEIVESPIPCFNGKLVPANHVNSAAMMDALQYQNTSCETWMDHVLVYSVNLLKERKKATDRPPSCQQAIVLITDSLYENYTDLMNVLDPDGSIRVFVLWLHDPNGVRDSTHFYGESVSCSRDGFFAELITHADVTERVMNILRVLERPLVSQRKQRLRVYSDVYANVEDPRRGEYYWQQKENTEQMYRYTQLRRNKDKFLNSDRLYSDYLHMHKLEKFGQYYEGQDINYRLQVTVSVPVFDSTTSEIGAGGSLFLIDHRGNIVLHENAKPVFDGDILKPGYRTVDLLDVEQPAVEHWPRHYPQEWLEFRNTLVVEQPSGTKTMYAKSIFDEGMRAFLEMKEYHWKRVKNYYTIVVTLTKYNNKHAVPEAKFTQALANAAMNALHGTDFSVHPDWLYCQHVDPHFDTREAEVLHFLRRRRDEPNFSMRKINHVFSPIKPTLLEKTYQCNEELMARFSREAIATNSWTKIAESTEHACTTCRLGSTTAFFASESGLTRWQQYHATSPHAEPPSGGEWPRGPGETWYRRASATPDLIVHAPVPPIRLLRNSFIEPPELGERYKWLTAARMIAHANKGTIGVAGYHFHRQHLKDVLKSITDFPCDEEEECEPRCDGEEWSCVLVDEGGWIVSDTEEEDEEQAKEPVTQHLANVYPTAMSALLNASIFKLHWIHDYQAVCFPSTKEIIRPKHKKVKSSAPNLPSLIRSLWTSLSEILLISQEMFTFLTLLTTIPDGVNADTEAEKEKRRKKIRRDFEREKYERLFDPRVLVNRTHFAACDRSRALYVLQRNQRAMEALRRKPHLCKWPLVGTEVPKTNLLLLAIYKGCPFTGKPLNDPFINELVSLADDEEGRSMAARLACWRTRVPLPARAPSTQCFPHHYGDEEGYRQCGPWLPDPPKKSADVKHVTLLINLPVILMLLA-