Monarch geneset OGS2.0

DPOGS215386
TranscriptDPOGS215386-TA1914 bp
ProteinDPOGS215386-PA637 aa
Genomic positionDPSCF300088 - 437011-439289
RNAseq coverage97x (Rank: top 62%)
Annotation
HeliconiusHMEL0097200.059.24% 
BombyxBGIBMGA012403-TA6e-15244.53% 
Drosophila% 
EBI UniRef50UniRef50_F4X6292e-1624.55%Putative uncharacterized protein n=4 Tax=Acromyrmex echinatior RepID=F4X629_ACREC
NCBI RefSeqXP_001851405.19e-1422.63%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3800237085e-2123.59%PREDICTED: uncharacterized protein LOC100868926 [Apis florea]
NCBI nr blastxgi|3800237085e-2423.59%PREDICTED: uncharacterized protein LOC100868926 [Apis florea]
Group
KEGG pathway 
Orthology groupMCL25965 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215386-TA
ATGACTTCAGTAACAATATTCAAGCGAAAAAAAAATAAAGCAGAGTTTACTGCTGAAAATATAAACAATACCGCTCCTGTGAAAAATATTTTAGAAGAATTTCAAGATAGGAATGCCCAAGCTTACATGTATTACTGTCATAATGTGAGACAACGATTAACAACGCCGAAGACTCTACAAAACTACCACTTCGTAGTACCTGGCTATCCATTAAATTGTTTTGACGATGATTGGGATGGAGCCTTGAACTTTGACTACTTTGAACATTTTTCGTACAAAAGCTTATATCCAAAAATGATCACAGAATTAAAAAATACAATGAAAGCAGACCCATTGACAAGTTTTATGAAAACTAAAATGTCTTGGCAAAGAGATTGTCGCAATTTAAAATTAACTGCTGCTTTCACTAACATGAATCAAAAGTATGAAGTTTTAAATGATGAGGTGGTACAGTGTCCCAAGCATTTAGTAGACACAGCTGCCCATATAGATAGTGATCCCGACCCAAATTTTGACAGCTCATATAACTGGTACTATGCAGGCAACGGTAACCTACAGCTTGTGTGCATTGATTGGATTGATTACTTACTACATAGTGAATTTTCCAGTGTCTATTTAACACAGTTGAATAGAAATGATTTAAATATTAAGCCCAACATAGAAGCATCTTTTGACTGTGGTGAAGGGAAGAACATTTTGGAAACCATATTTTCATCACAAAATATTACAGTTTTAAGGACAAAATACAAGATATTTATATTAAGTTTGACAACTGGTGATGAAATAAAATTTGAGAAAATCAAAGGAATCGATTCTGAGGTCCCTTTCACAGGGATTTCATTTGATGCTTTCCATAAAAACATATTATATATAACTACTTTAGATAGTAAATTATTTATAGTAAATCTAGATAGATTGAAAGCAAAGATTATAAATCTGGTAGACAGGCCGACTTTAATAGATAACTGGAACACTGTGATCAGTTCAGAGAGGGGATTCTACACACATGTGGGAAGGCAGAGCGTAACGCTTTATGACAAAAGGAGTCATGACACCATCCATATATGGAAAAATGTAAGAAATATTACTGACGAAATGGCCTGCAATGACATAAGTGTGGCCAAGCACTTAGAAGGCACCTCGTCGCTGTACTTTGGAACGGATCATCATCTCTTTTTAATGGACTTGAGATTCCATCAAAAAAACGCCAAAGTAGTTCAGAGGTGGACACACGGCATGGAGTGTGTACCAACATATTTAGCTAATTGCATCTTTGAATCCAATAAAGATCTGATATGCTTAAGTAGTCAATGGTGTGAGGATATGTGTGTGGTGCCTAATTATAGTAACCGAAATTCCAAAGACACCGTAAATGGTGGGGTCTTTATACCTTACCGGCCGCCTAACATATTGAACACACTCAATGAAGCCAGGCAGCGGCGGCTATGTTACGATCTCTACAATCCGATAGACGGTAGACTGAGCAGCTCCATCACCGGGCTGGTGGTGATGGAACAGGATGACAGATACAACATTCTTATGCAGAACTCACTTGGGGACATTTCTTGTCACTCATTATTTCAAGAACATATGGAAACGTTCATCGAAGACGATAGCACGCAGTGTTTGCACGACTGGGCCGCGAAATATAAAGTAGGGGGTAAAGATTTCGAAGTCTCGTCCGTTCCGAATGTAGCTAATATTTGGAATAAATTAAAAAGAGTGCCGAGTGACTATGAGATCTGTGAAAATGTGGCTGTGAGTGAATTTGATGAAAAGGAGATTGCAAAGGCTTTCGATAATGAGGAAATTGACAGCGGTCTGCGAGAAGCCTGGCTGAAGACGGACGAGGAGCTTGGAGAACAATCATCACTGATGCTGAACTTACACTTCTCTGATGACGACGAGGAATAA

Protein sequence:

>DPOGS215386-PA
MTSVTIFKRKKNKAEFTAENINNTAPVKNILEEFQDRNAQAYMYYCHNVRQRLTTPKTLQNYHFVVPGYPLNCFDDDWDGALNFDYFEHFSYKSLYPKMITELKNTMKADPLTSFMKTKMSWQRDCRNLKLTAAFTNMNQKYEVLNDEVVQCPKHLVDTAAHIDSDPDPNFDSSYNWYYAGNGNLQLVCIDWIDYLLHSEFSSVYLTQLNRNDLNIKPNIEASFDCGEGKNILETIFSSQNITVLRTKYKIFILSLTTGDEIKFEKIKGIDSEVPFTGISFDAFHKNILYITTLDSKLFIVNLDRLKAKIINLVDRPTLIDNWNTVISSERGFYTHVGRQSVTLYDKRSHDTIHIWKNVRNITDEMACNDISVAKHLEGTSSLYFGTDHHLFLMDLRFHQKNAKVVQRWTHGMECVPTYLANCIFESNKDLICLSSQWCEDMCVVPNYSNRNSKDTVNGGVFIPYRPPNILNTLNEARQRRLCYDLYNPIDGRLSSSITGLVVMEQDDRYNILMQNSLGDISCHSLFQEHMETFIEDDSTQCLHDWAAKYKVGGKDFEVSSVPNVANIWNKLKRVPSDYEICENVAVSEFDEKEIAKAFDNEEIDSGLREAWLKTDEELGEQSSLMLNLHFSDDDEE-