Monarch geneset OGS2.0

DPOGS206787
TranscriptDPOGS206787-TA2112 bp
ProteinDPOGS206787-PA665 aa
Genomic positionDPSCF300001 - 5203084-5205253
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0072533e-8434.02% 
BombyxBGIBMGA010315-TA2e-0737.88% 
Drosophila% 
EBI UniRef50UniRef50_Q8MY382e-9335.40%Gag-like protein n=7 Tax=Papilio xuthus RepID=Q8MY38_9NEOP
NCBI RefSeqXP_002028747.12e-1931.05%GL15642 [Drosophila persimilis]
NCBI nr blastpgi|220040006e-9335.40%gag-like protein [Papilio xuthus]
NCBI nr blastxgi|220040004e-10635.43%gag-like protein [Papilio xuthus]
Group
Gene OntologyGO:00082703.3e-06zinc ion binding
GO:00036763.3e-06nucleic acid binding
KEGG pathway 
InterPro domain[563-606] IPR0130843.3e-06Zinc finger, CCHC retroviral-type
Orthology groupMCL18549 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206787-TA
ATGGGCGTCACCCGGGGGGCGACCCTAAGTGAGAGTGAAGAAGTAGAGAAGGAGATTGAAAAGAGGACAGGCGGGGTAACTGCTTTGGAGTTTGGCTCTAATCAAAGCCTTAACATGAGCATTGCCAGCTCTGACGGCGAGATCAATAATGGAATACGTAAGAAACGCACCCGTGACGCGGAAGCCGATAGTGAATCGGAAGAAATTCCTGGTCTTTCCAAGTCTCGGATAAAGCGCAGTATACAAGTGCCAAAACGAGGGAGAGGTCGCCCACCCACCACTGGAGTGTATGTGGGAAATGCAAGGGTGAGGAAAGAGCTCGCCGAAGCTAAACGGGCGTCAATGGAGCTTCTTACACAGACAGAGGTGTCTAATTTCACCCAAAAGCGTCGGCTCGAAAGAACGTCTCGATCTCTTGAATCGTTGGTCACCCTGGATTCTCGTCCTTCTAATAGTATGTTTGACAAGGTCGAGAAAAGCATCAGTGTCATCACCGAAGTGGCAATAAAATCGAAGAACTTGAAGGGTGCCTTTGTCAAAGCTCTTAATGAAGCGGCATCCAATATTCGAGAGGCTATGGAAGCCCTCCAATGTCAATCTTCCACGGAGGAAACTCGACGTCTTCAAGCGGACAATAAGAGTCTTCGAGATGAATTGGCAGCACTTCGCAAAGAAGTACATAATATGCGTGCAGCCATGGGAAAGTCGGCAAGTCCATCCCAGGAACCTATACTGCCAGTGCAGAGGATGGATGACGTCGTGCGCGAAGTTTTGTCCCAGGTAGGGACAATGATTAACGCACGATTTGAAGCAATGGAAGAGCGATTGCTTCCTGAGAAGCGTCTTAGACCACCTCTGGCCGCGGATAAACGGGCTGCAGAACGAGTTGGGATATCCTCAGCGACGAACTTTGCCCTCTCACCTCTGGAGAGGCCATGGAATGCTGTAGTCCAAAGTAAGGGAAAAAAGTCAACACCTGCAGCAACCACTTCTGGGGCTAACTCTATCGAGACAAGTGCTTCCATATCAAAACGAATGACCCAGACGACAAAACCGTTTCACCGAAAAGCACCAGACTCACAGTCTACGGCGGCATCTAATCTTGCAGAGGCGTCTGTGTCTACTGCAGAGTCTTCGAGGAAAAACTTTAAAAACAAGAAAAGACAAGCAAAGCAAAGGTTAAGTGCTCCACGAACGGCAGCCGTAGTCCTTACATTGCAGCAAGATGCTGTTGAAGAAGGAATCACGTACAGAGATATCCTTGCCAAGGCACGCCAAAGGTTGGATCTGAGAGCCTTAGACATCCCTGCAGGCCTTACAATTAGGCAGGCAGTCACTGGTGCACGAGTACTGGAGTTGCCTTCAGGAGTCTCGAGCGAAACAGCAGATCGTTTTGCTGTTAAGCTTCGGGAAGTTCTCGCCGGTGAAGCACGAGTCACTAGGCCGATAAAGTGCGCGGAGTTACGTCTTACTGGTCTGGATGACTCAATAAGCAAAAACGAGGTACTAGCTGCGGTTGCATCAACGTCAGGCTGCCCCCCAGAACATATCAAAGTAGGTGAGATTCGTTTTGGTGCTCGAGGAACTGGATCTTTATGGGTACAGTGCCCAATTACAGCTGCAAAAACACTTGCGGCGACTGGGCACCTACAAGTGGGATGGAGCAAAGTTCGCATTGTAACGTTAGAGCAGCGCCCGATGAAATGTTTTCGGTGCATGGAAATCGGGCATACACGGCTCCAATGTAGCTCACAAGTAGATCGCACTAATCTCTGTTACCGCTGCAGTGAACCTGGCCATAAGGCGGCAACATGTGCCGCTAAACCGCATTGTGCAGTCTGTGCCCATGTGGGCAAGCCTGCGGAACATACGATGGGGGGCAGAGGATGCTCCCCTACAAAAAATAGGAATAAGGTTGGCCGTGTCCCAGCCTCCCAACAGACGGATCAGCAGACATTGGAATCGGAGGAAACTAATATAAACCCTACTAATGGATAATGTTCGAAAGCAATTCCTCCAGGTGAATTTGAACCACTGCGCAAGTGCCCAAATTTCGTAGTTTTCTTGGAACCTCCGGTTGCCACCTTATATAATTAACTCGCATTTTGTTAA

Protein sequence:

>DPOGS206787-PA
MGVTRGATLSESEEVEKEIEKRTGGVTALEFGSNQSLNMSIASSDGEINNGIRKKRTRDAEADSESEEIPGLSKSRIKRSIQVPKRGRGRPPTTGVYVGNARVRKELAEAKRASMELLTQTEVSNFTQKRRLERTSRSLESLVTLDSRPSNSMFDKVEKSISVITEVAIKSKNLKGAFVKALNEAASNIREAMEALQCQSSTEETRRLQADNKSLRDELAALRKEVHNMRAAMGKSASPSQEPILPVQRMDDVVREVLSQVGTMINARFEAMEERLLPEKRLRPPLAADKRAAERVGISSATNFALSPLERPWNAVVQSKGKKSTPAATTSGANSIETSASISKRMTQTTKPFHRKAPDSQSTAASNLAEASVSTAESSRKNFKNKKRQAKQRLSAPRTAAVVLTLQQDAVEEGITYRDILAKARQRLDLRALDIPAGLTIRQAVTGARVLELPSGVSSETADRFAVKLREVLAGEARVTRPIKCAELRLTGLDDSISKNEVLAAVASTSGCPPEHIKVGEIRFGARGTGSLWVQCPITAAKTLAATGHLQVGWSKVRIVTLEQRPMKCFRCMEIGHTRLQCSSQVDRTNLCYRCSEPGHKAATCAAKPHCAVCAHVGKPAEHTMGGRGCSPTKNRNKVGRVPASQQTDQQTLESEETNINPTNG-