Monarch geneset OGS2.0

DPOGS210474
TranscriptDPOGS210474-TA1677 bp
ProteinDPOGS210474-PA558 aa
Genomic positionDPSCF300062 + 430630-432782
RNAseq coverage225x (Rank: top 44%)
Annotation
HeliconiusHMEL0160970.068.11% 
BombyxBGIBMGA002766-TA0.061.25% 
DrosophilaCG11188-PA4e-4131.82% 
EBI UniRef50UniRef50_D6WPL32e-8942.80%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WPL3_TRICA
NCBI RefSeqXP_966656.13e-9243.26%PREDICTED: similar to apoptosis antagonizing transcription factor [Tribolium castaneum]
NCBI nr blastpgi|3407179657e-9440.66%PREDICTED: protein AATF-like [Bombus terrestris]
NCBI nr blastxgi|2700103811e-10944.01%hypothetical protein TcasGA2_TC009771 [Tribolium castaneum]
Group
Gene OntologyGO:00056343.4e-27nucleus
KEGG pathway 
InterPro domain[454-536] IPR0126173.4e-27TRAUB
Orthology groupMCL12603 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210474-TA
ATGAAATTTAAACAAAAGAAATCTTCTAAACTGACTCTGTCAGATAAAATAGCAGATGCGTTAACTGTTAAACCTAGAGCGGATATAGAAGATGATGTTATATTTGGCACGAAACCAAATACAGTTTCTCGTGCGGATTATTCCTCTGACAGCGAGGATGAGGACGCAATTAGCGACTTTAGGAAGCGTAATGTTAATTTATTAAGTGAGATAAGTAAGAAGTATGAAGGTCAAGTAGTATCTCGGAAAGAACTAGACCGAAATAGCGCAGATGAAAGTGTAGACGATGAGAGCGATGGAAAAAAGTCCGAGCAAGAAATTTCTAAGAATCTTAAGGGTCTGATGTCGGATGACGAGGAAAGCCAAGGAAGCGATGACAGTATCATAAAAAATGTCAAATCCAGAGTTAAATCTGATGATAGTTCATCAGAAGATGAAGAGAGTGATGACTATGATATAGTTAAACATAGAAACGAAGATGAAAGCGAGGAAGATGGCTCAGAGGAAGAAGGAGGATTTGACATCAGTCAAATGGAAGAGCCTGTTAAAGAAGAATTTGAGCATGTCAAAAAGCAAAATGTTTCAGAAGAGGCTAAAAAGGGTATGGCCGTCAGAAACCAGTTGCTATTATGGGAAGGTCTTTTAGAAATGAGGATACATTTACAGAGATGTATGAACTCGGCAAACAAAATGCCTATGTCTGATACGTATGAAACTCTTAAGAACCATTCTGACTTTGTTGAGGAATCTGGGACGGTGATCAACAATGTCGCAAATGTTTTAGACAAATTTTTAAATCTGCAGAGTCTATTATTAAAGCAATATCCTGAAACTAAGACCATATCAAATAAGAAAATTACATCCGAGGCGCAGCAAAAGCAAGGAGAAGGGAGTGACGAAGAAATCCCTAGTGACACAGACAATGAAGAAATTCCTTCCGATACCGAAAGTGAAAATGATCAACCGCAGACGAAAGCAGATAATAAGAAAACCAATGAAAAGAAACGAAAACTAGAAGATTATGAAAGTGATATAGCAACGACCCACAAGGCTTTTAAATCATATAGAGATGCGACAGTCAAAAAATGGAATGAAAAGACACGTCTAGCGACCGCAGCTAATATTAAAAGTTCACCCACAAATACTATCCTACAACAGATATCATATATATTGTCGGACAGGGATAAGCTTATACGTCGGACACAATTAAAGAGATCCGAGTACGATATTATTGGATACAAAAAAGATCCAACTCCCACTGAAAATAGAGATCAAAACGGAATGGGAATAAATCCAATAACAAGAGACAGGAAAGACAATGATGAATACATTCCAGAAATCTTCGATGATAGTGATTTTTATCATCAATTACTGAGAGAATTAATAGAGTGTAAATCAGCTGATATATCTGATCCAGTCCAACTCAGTCGCCAATGGATCGCTCTGCAGCAGATGAGGAGCAAGATGAAGAGGAAAGTTGATACGAGGGCAACAAAGGGTAGGAAAATTAAGTATGTTGTACATAACAAGCTAGTCAGTTATATGGCACCTGAAAAGTCTATTACATGGACCGATGAGAGCACTAATGAGCTATACAATTCGCTGTTCGGCAAAATGTTTGAGAGCAATAATGTTGGCACGAATATAAATTTGGATAATGTAAAACTTTTAAATTAA

Protein sequence:

>DPOGS210474-PA
MKFKQKKSSKLTLSDKIADALTVKPRADIEDDVIFGTKPNTVSRADYSSDSEDEDAISDFRKRNVNLLSEISKKYEGQVVSRKELDRNSADESVDDESDGKKSEQEISKNLKGLMSDDEESQGSDDSIIKNVKSRVKSDDSSSEDEESDDYDIVKHRNEDESEEDGSEEEGGFDISQMEEPVKEEFEHVKKQNVSEEAKKGMAVRNQLLLWEGLLEMRIHLQRCMNSANKMPMSDTYETLKNHSDFVEESGTVINNVANVLDKFLNLQSLLLKQYPETKTISNKKITSEAQQKQGEGSDEEIPSDTDNEEIPSDTESENDQPQTKADNKKTNEKKRKLEDYESDIATTHKAFKSYRDATVKKWNEKTRLATAANIKSSPTNTILQQISYILSDRDKLIRRTQLKRSEYDIIGYKKDPTPTENRDQNGMGINPITRDRKDNDEYIPEIFDDSDFYHQLLRELIECKSADISDPVQLSRQWIALQQMRSKMKRKVDTRATKGRKIKYVVHNKLVSYMAPEKSITWTDESTNELYNSLFGKMFESNNVGTNINLDNVKLLN-