Monarch geneset OGS2.0

DPOGS205143
TranscriptDPOGS205143-TA1941 bp
ProteinDPOGS205143-PA646 aa
Genomic positionDPSCF300246 + 23605-25545
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0047461e-14773.03% 
BombyxBGIBMGA006842-TA1e-1128.24% 
Drosophila% 
EBI UniRef50UniRef50_D2A5M12e-6429.06%Putative uncharacterized protein GLEAN_15149 n=2 Tax=Endopterygota RepID=D2A5M1_TRICA
NCBI RefSeqXP_001815322.13e-6627.13%PREDICTED: similar to orf [Tribolium castaneum]
NCBI nr blastpgi|1892362966e-6527.13%PREDICTED: similar to orf [Tribolium castaneum]
NCBI nr blastxgi|1907023809e-6626.52%retroelement polyprotein [Glyptapanteles flavicoxis]
Group
Gene OntologyGO:00036764.9e-25nucleic acid binding
GO:00150741.6e-14DNA integration
GO:00036771.6e-14DNA binding
KEGG pathway 
InterPro domain[377-543] IPR0123374.9e-25Ribonuclease H-like
[382-494] IPR0015841.6e-14Integrase, catalytic core
Orthology groupMCL10083 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205143-TA
ATGAAACATTTATCTCTGTTACTGGAAGCAATCTCAAAAGAGGGCTTTAGATTAAAATTCTCTAAATGCAATTTTGCACAGGATTCTGTTAAATATTTAGGCCACATAATAAAAAATAATACAATTACGCCGCTTAAAGATAATTTAATAGCAATAAAAGATTTTCCAACTCCAAAAAATAAAAAAAATATTCGTCAGTTTCTAGGTAAAATTAACTTTTATGGAAAATACATACCAAATATATCAATCATATTAGAACCTTTACACAATTTGTTAAGAAAAGACCAAAATTTTACTTGGACGAAAATTTGTCAAGAATCTTTTGATAAAATAAAGAATATGCTTTGTTTAAGACCTATATTAGAGATCTTCGATCCAGATTTACCTATACATATTTATACAGATGCCAGTATTCAAGGTATTGGAGCTATATTAAAACAACCACAAAAAAATGGAGAAGAGAAGCCATGTGCATATTTTTCTAGAAAATTAAATGATAGTCAAAAACGGAAAAAAGCTATCTACTTAGAAGGTTTAGCAATAAAAGAATCAGTCAAATATTGGCAACATTGGCTCATAGGAAAAAAATTTAAAGTGTTTTCTGATCATAAACCTCTAGAAAAACTAAATATTAAAGCAAGAACCGACGAAGAGTTAGGAGATCTCGCATATTACTTATCACAATTTAATTTTGAAGTTATATATTCACCAGGAAGGGACAATGTAGAAGCAGATAGTTTAAGTAGAAACCCTGTGTTAGAACCACATGTGAACCAAGATGAAGTCTTAAAAATAACAAATTTTCTTAAACTAGAAGAGATTCAAAAAGATCAAGAGGAAAATGAAAACATAAAACTTAATAAAACAAAACTATTATTAAAAAATAAAGTATACTTTAAAAGAATAGGAAAAAAAGAAAAAATTATACTCACTGAAGAATTTAGTAAAAAATTAATTAAAAATATACATTATGAATACTGTCATATAGGGGTGAAACAAATAGAAAATAAAATCAAACCATTTTATACAGCAAAAAATTTAATAAACAATATAAAAAATATTTGTGATAATTGTGAAATATGTTTAAAAAATAAAACTAGAACTAAATTTAAATTTGGATTAATGTCTCATCTAGGCCCTGCCACCTACCCTTTCGAAATAGTCTCTATAGATACTATTGGTGGATTTGGTGGATCTCGCTCTACAAAAACATATTTACATATTTTAGTGGACCATTTAACCAGATATGCTTACATTTTAACGTCTAAAACACAAAATGCAAATGACTTTATAAAGCTTGTAAAAAAAGTTATCCCTGAAAATAAAATTGGAATGATTCTGTCTGATCAATACCCAGGCATAAACTCAAAAGAATTTAAAGGTTTTTTAATAAAAGAAAATATACCAATTATCTTTACCGCAGTAAATGCACCTTTCTCCAACGGTTTAAATGAACGTTTAAATCAAACAATAGTTAACAAAATTAGATGTAAAATTAATGAAAAGAAAGATAAAATGGCTTGGACGACTATAGCTCATGAATGTATCCAGAAATATAATGAGACTGAACACACAGTTACAGGATTCTCTCCAGTATATTTACTAGAAGGAAAAGCTATTAATATCATACCAGATGAATTAACAGAAAAAAAGTTACAAAGTAATTTATTACAAGACAGAAAAATAGCTTTAGAGAGAACTATTAAATCTCACAACTACAATAAACAAAAATTTGATAAAAACAGGAAAAGTTGTCAGTTAAAAGTAGGTGATTTAGTATATGTAGAAAATGGAAACCGGTTAAATAGAAAAAAACTAGATGAAATAAAAATTGGACCATATAAAATATTAGAAAAATTATCAAACTCAATTTATAAGATAGATACAGGTTACAAAAAATTTAATCAAACTATTTCCACATTACCAAAATTATACCAATAA

Protein sequence:

>DPOGS205143-PA
MKHLSLLLEAISKEGFRLKFSKCNFAQDSVKYLGHIIKNNTITPLKDNLIAIKDFPTPKNKKNIRQFLGKINFYGKYIPNISIILEPLHNLLRKDQNFTWTKICQESFDKIKNMLCLRPILEIFDPDLPIHIYTDASIQGIGAILKQPQKNGEEKPCAYFSRKLNDSQKRKKAIYLEGLAIKESVKYWQHWLIGKKFKVFSDHKPLEKLNIKARTDEELGDLAYYLSQFNFEVIYSPGRDNVEADSLSRNPVLEPHVNQDEVLKITNFLKLEEIQKDQEENENIKLNKTKLLLKNKVYFKRIGKKEKIILTEEFSKKLIKNIHYEYCHIGVKQIENKIKPFYTAKNLINNIKNICDNCEICLKNKTRTKFKFGLMSHLGPATYPFEIVSIDTIGGFGGSRSTKTYLHILVDHLTRYAYILTSKTQNANDFIKLVKKVIPENKIGMILSDQYPGINSKEFKGFLIKENIPIIFTAVNAPFSNGLNERLNQTIVNKIRCKINEKKDKMAWTTIAHECIQKYNETEHTVTGFSPVYLLEGKAINIIPDELTEKKLQSNLLQDRKIALERTIKSHNYNKQKFDKNRKSCQLKVGDLVYVENGNRLNRKKLDEIKIGPYKILEKLSNSIYKIDTGYKKFNQTISTLPKLYQ-