Monarch geneset OGS2.0

DPOGS212524
TranscriptDPOGS212524-TA2364 bp
ProteinDPOGS212524-PA787 aa
Genomic positionDPSCF300222 + 517884-524371
RNAseq coverage783x (Rank: top 16%)
Annotation
HeliconiusHMEL0093375e-12241.84% 
BombyxBGIBMGA009793-TA3e-3540.53% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|502860916e-1121.97%hypothetical protein [Candida glabrata CBS 138]
Group
Gene OntologyGO:00055151.4e-06protein binding
KEGG pathway 
InterPro domain[729-787] IPR0109931.4e-06Sterile alpha motif homology
Orthology groupMCL25653 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212524-TA
ATGATGCAACAACAGGTCTACCAGCAGGATGCTACCATGCAGGAGCAAATCGGCCAGCAGATGGTGCTGTGTCCTGTACGTTTCGTGTATGAAACCCAGATGTTGATGCAGCCAGGTGAACAGATCCAACAGCCGCAAACTATCCTCATCAATCCACAGAACCCACCGCCATGGATACAGAACAGACCTGTCCAGAACCGGGTGATATACGTCCAGCAGGTGCCAGGCGGTTACATACCGTCACAACAACAGCATATAGACCAAAATCAATTGTACATACAGAACTACGGCTACCAGAACGTACCGCAGGTGCTGGTCCAGAACCAGGAGCAGAGGATGCAGATGATAGCACCGAACATCATGACGAACGTCCAACCTAATATCCAAACCAACATTGGACCTGTCCAAAATCCTAACGTCCAAAGAATTATATCCAACGTACCCATGCAGAACCAGATGAACATGAACCAACCGGTCAACCAGCCCATGCCCATCAACAGACAAATAATATCCAGCCAGATGATAAGCATGAACGAGGTCCCCAATAACGTTGACACGCAGGAAATGCCCGCCTCCAATATCTACAGACAGACGCCACAGATAATACAGGATCCTAGATTTTCCTCGCAGACAATAACAATGACGCCAAATAACATGCAAAATGGCCCGAGGTACGAGACCAATACAAATGTCTGCCAAATAAGACCAACACAGCCGATGACACAGCCGATGCCTCCCTACCCGCGACCGAGCAGCGTCGCTGAACCAAGGAAATTAATAAACAAAGCAACCAACTTTAACACGATGACCACCAATGTTAGGACTCCGACACAAATGCTGAGACCCATACAACCCAGACCCAACCAAATCAGACCCAACACGAATTTCCTGCCAATACAGCCAGCAAACTCTCAAAGCACAGCCAACAGCAACAGAAAGATAATGCCAAATCTAATACCAGCAACGAACATGAACGACACAACGAGCAGGAGCGTCGCGTACAACAGGAAACGGAAAAGCGAATCACCCGACGAGATACACAAAAAAATACCCATAAGCAACACGAGCGACGGCGTCGTGCTCATAAAACAAGTGGAAAGTGCGGCTAAGGTGTCCGACGTGGGCGTCAACACCAGCCCCATACACAAAGCGGACGGCAGGATCATCATACACAACATGCAGATAACCCCGATCACCGCGAGCAACCAGGGCAACAAAATGAAGGAGGCAGCCGCGAAACACGTACGGAACACGACGACCCTGGAGCCGCAGGACGTCGTGAGAAACGCGCCAGCACCACCCGCACCGTCTAAGTCCTTCTCTAGTCTCGTTGAAAAAGAAAAATTGATCAGGAACACCGTTTACACGCAAGCTAGAGGCAGGGTGCTGAGCGACAAAGCCGTCAGTGACGTCACCAAGCAGGAGAACAGCGTGGACACGAGTAGAATGGAAATCACGTGCAGCAGCAACGAAAACAGAACGGCTGAAGCGAAGCCAGCACCGACGCTGAACCACGACGCCAGACATTGTGATATAAGTGAAAATAAAACGGCTGAACCCAAGCAATCCCAACCGGAAGTCGTCAAGGAGGCTAAGACAGTGAAAGTAGAAAACAAATGTGCTGTCACCGAAAATATAACGCCCGAACAGACACCACAAGCAGACCAGGCGAGAGACACGAACGGCCACGCGGAAAACGAGACGGCTGAAAGCAAAACACAACCGAAGATAATGAAAGAGATCAAGAAAGAAAAAAACGAGGACGCCATGGACGTGTCGCCAGTCAAGAAAGAAGATGGCAAACCGAAATATAGTAAAATTAAACAGGAGAAAATACCGAAAGAGGAGAGAATAGAAAAGTGTGACGTCACAACGAAACCGGACGTCAAGGTCAAAGAGGACAGGGATTTTATCCTGACGCACGTCCTCGGCGGCTTCGTGATACAGGAGTCGAATGTAGCGTTTCCGATAAGGAAACCGTTGAAGGAAAAGACGTTGGAAATAAACGAAGTGAAGCAACAAGATAAAGATAAAGATACGAAGGAGATCAAAAACACTAGCAAAATATTAGACATATCCAACCTGAACATAGAGGAGTGCGAGAATATGAGAGGCAGCAGCGACGGAGATGGCAGGCAGAAAGAAAATCCGTTCTCCACATTAAAATTGGGCACCGTCAAGAAATGGACGGCCGAGCAGCTGTCAGCGCACCTCAGCAAGTACTCCTGGGCCGAGACCGTGTCGGTTTTACAGGAACACGAGATCGACGGCGAGTCACTGTCGCTAGTGTCCAAGTTGCAGCTCATAAGCATAGGGGTCAGCGAGAACCACGCTGACATCATAGCCGACTTCATCAAGAAATAA

Protein sequence:

>DPOGS212524-PA
MMQQQVYQQDATMQEQIGQQMVLCPVRFVYETQMLMQPGEQIQQPQTILINPQNPPPWIQNRPVQNRVIYVQQVPGGYIPSQQQHIDQNQLYIQNYGYQNVPQVLVQNQEQRMQMIAPNIMTNVQPNIQTNIGPVQNPNVQRIISNVPMQNQMNMNQPVNQPMPINRQIISSQMISMNEVPNNVDTQEMPASNIYRQTPQIIQDPRFSSQTITMTPNNMQNGPRYETNTNVCQIRPTQPMTQPMPPYPRPSSVAEPRKLINKATNFNTMTTNVRTPTQMLRPIQPRPNQIRPNTNFLPIQPANSQSTANSNRKIMPNLIPATNMNDTTSRSVAYNRKRKSESPDEIHKKIPISNTSDGVVLIKQVESAAKVSDVGVNTSPIHKADGRIIIHNMQITPITASNQGNKMKEAAAKHVRNTTTLEPQDVVRNAPAPPAPSKSFSSLVEKEKLIRNTVYTQARGRVLSDKAVSDVTKQENSVDTSRMEITCSSNENRTAEAKPAPTLNHDARHCDISENKTAEPKQSQPEVVKEAKTVKVENKCAVTENITPEQTPQADQARDTNGHAENETAESKTQPKIMKEIKKEKNEDAMDVSPVKKEDGKPKYSKIKQEKIPKEERIEKCDVTTKPDVKVKEDRDFILTHVLGGFVIQESNVAFPIRKPLKEKTLEINEVKQQDKDKDTKEIKNTSKILDISNLNIEECENMRGSSDGDGRQKENPFSTLKLGTVKKWTAEQLSAHLSKYSWAETVSVLQEHEIDGESLSLVSKLQLISIGVSENHADIIADFIKK-