Monarch geneset OGS2.0

DPOGS206816
TranscriptDPOGS206816-TA1524 bp
ProteinDPOGS206816-PA507 aa
Genomic positionDPSCF300001 - 3753660-3772619
RNAseq coverage751x (Rank: top 17%)
Annotation
HeliconiusHMEL0100089e-15470.13% 
BombyxBGIBMGA012759-TA3e-7574.40% 
DrosophilaCG4658-PC2e-12140.19% 
EBI UniRef50UniRef50_UPI00022474921e-12445.09%UPI0002247492 related cluster n=1 Tax=unknown RepID=UPI0002247492
NCBI RefSeqXP_974177.28e-13345.99%PREDICTED: similar to Ankyrin repeat domain-containing protein 13C [Tribolium castaneum]
NCBI nr blastpgi|2700088991e-13146.98%hypothetical protein TcasGA2_TC015511 [Tribolium castaneum]
NCBI nr blastxgi|2700088997e-13046.98%hypothetical protein TcasGA2_TC015511 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[296-453] IPR0065717.7e-12TLDc
Orthology groupMCL15999 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206816-TA
ATGGGAAACAATAATTCACATGGAAAGAGTGATATAAAAGACGGCAGTGGCATGAGCACTCCTAGAGGCTCAACCCGCAGCATGAGATCGGGGTCCAACGGAGAAGTGCAAGATCAGCCAGAAAAGTCTGAGCAACAGACAACGTCTTTAGTGCCCGTTGAAAAACTTGGCAAGCTCCTTGCGCACCGATCACAGAAGGAAGATGGAGTCAACGGGGTTACGGAAAAATCTTTTACCAAATATCTATTTCCGATGTATCCAGAACTGGCGCGTCAGTTTTACAACTATGTTCACAGGACAGCTAAGTGTAAGAACAATCACATTCCCCTGTCTGCGTTCAGACAACAGTGCGAAAAGATACTGGCCATGCTAGATGACAACGCCATCATTGAAACGTATATAAGAATGTTCAGCTCCGAACCCGAAGAAGGCAATGTAACTCCTAACGATCTTCGCGCTCTTCTCTACATCAGCTACAAATTATCGATGGACCACTATCCCGAAGGTCCACAAGCATGTCCAATGATAAATAAGACGCTTTCAGCAGTCATCAGGGGGTGTTTCAACAATAAAGATTCCCATAGCGTCGGATTCGTGGTACGCTGGCTAGAGGAGCACTGTCATCGACTTATATTCCCCATACACAGGTACTGTGTGCACATCCTAGCGACTCGTCACCGCGACATTGAGTCCCTCAGCAGCAGCAGTAGCAGCAGCGGCTCGCTGGAGCTCGGCACTCCGGTTCTGGAACGCGGTTCCTGGGCGGGCGGCTCGGGTGCTGGTTCCGGTGGCAGTGGACTCCTGCCGGTTTCAGCCGCTTGGCTGCTGGCAGCAACGGCACCCCCACTCTACTCCAGACCAGCCAAACCCCCGCCTCATCCAGGTGGTACAACTGCTTCGACTGCGTGGATGGCTCGTCTGGTGTGCGCGCTGCCCTCACACTGGGTGCCGTTGTACGCCTCCAACGAGCACGGACTGGGAGCCAACAGGTTCCTTCACCACACGCTAGCATACCGGGGTCCAACACTAGTAGTTATAGAGGGCAAGTGTGCAGGTGAGGAATCTTCGGAATCGGAGAGGGTCACTGTAGTGGTGTGCGCTCCTCAGGAATGGCGTGACACACACCAGTACTGGGGTGGACCACAGGGAGCCCTTGTGCAACTATATCCCACGTTGTCTTTGGTGGAGTGCGGTCCGAAGCTGTTGTACCTTAATTCTCATATTCGTGGGTACCCACGTGGTCTCCGCTCGGGCTCGGACCCACGTCGCCCGCTGCTGTCTGTAAACGAAGGTTTCGATTCATTCACGTTCAAAGGAGTGCCGTACACTCTCTGCGCTGTTGAGGTGTGGGGTTGTGGTGAGCAGGCCCTGAGGGAAGCACAGCTTGAGGTCAAAAAGTGGCAGGTCCGGGAGGCTGAGAGACAACGACAGGTCAAGATATCAGCTGCGGACTGGATAGAACATCCCGATCGCTACCTACTAGAGCTGGCAGGCCGGCCACAATACAACAATGCTGCCACATAG

Protein sequence:

>DPOGS206816-PA
MGNNNSHGKSDIKDGSGMSTPRGSTRSMRSGSNGEVQDQPEKSEQQTTSLVPVEKLGKLLAHRSQKEDGVNGVTEKSFTKYLFPMYPELARQFYNYVHRTAKCKNNHIPLSAFRQQCEKILAMLDDNAIIETYIRMFSSEPEEGNVTPNDLRALLYISYKLSMDHYPEGPQACPMINKTLSAVIRGCFNNKDSHSVGFVVRWLEEHCHRLIFPIHRYCVHILATRHRDIESLSSSSSSSGSLELGTPVLERGSWAGGSGAGSGGSGLLPVSAAWLLAATAPPLYSRPAKPPPHPGGTTASTAWMARLVCALPSHWVPLYASNEHGLGANRFLHHTLAYRGPTLVVIEGKCAGEESSESERVTVVVCAPQEWRDTHQYWGGPQGALVQLYPTLSLVECGPKLLYLNSHIRGYPRGLRSGSDPRRPLLSVNEGFDSFTFKGVPYTLCAVEVWGCGEQALREAQLEVKKWQVREAERQRQVKISAADWIEHPDRYLLELAGRPQYNNAAT-