Monarch geneset OGS2.0

DPOGS205863
TranscriptDPOGS205863-TA2250 bp
ProteinDPOGS205863-PA749 aa
Genomic positionDPSCF300339 - 151358-161435
RNAseq coverage60x (Rank: top 68%)
Annotation
HeliconiusHMEL0072533e-6734.64% 
BombyxBGIBMGA010315-TA6e-0733.70% 
Drosophila% 
EBI UniRef50UniRef50_O014182e-8436.81%Gag protein n=7 Tax=Endopterygota RepID=O01418_BOMMO
NCBI RefSeqXP_002028747.14e-2029.79%GL15642 [Drosophila persimilis]
NCBI nr blastpgi|20552756e-8436.81%Gag protein [Bombyx mori]
NCBI nr blastxgi|20552756e-9935.91%Gag protein [Bombyx mori]
Group
Gene OntologyGO:00082708.3e-07zinc ion binding
GO:00036768.3e-07nucleic acid binding
KEGG pathway 
InterPro domain[645-685] IPR0130848.3e-07Zinc finger, CCHC retroviral-type
Orthology groupMCL18549 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205863-TA
ATGTCTGAGTTTGCAAACTTGTGTGTGCGCGGGTTCTTAGTCGGGACCGAAGTCCATCCAAACCTCTTGCTGCATAGCCGTGATTGTAAACGTTTAGGAGATGTTCGCACAGAGGGCGGCAACACTCTCCCCGAAGCCTTGCGTCGGATGAGAGATCTGGCCAAGGATTGGACGATCGACTCGTGGGCTGACCAACTCAGACAGCCCCGAGTCAGTGTCGCATTAAACAAAAATGTATATAATGTAAACAAAAAAAAAAGAGGGAAATTCTTCCGCGCAAGGGGGGGTACGTCTGACTCGGACGAGAGTGGGAGTGAGACCGTCACCTCCCTGAGATCGGAAGAAGACGACGGAATAGAAAGACGACGCTTCCGGAAGAGGAGGGGGAGCGAAGAGATTGGAACAGGCTCTGACAAGGGCGCGGCGAAGCCCACCAAGCGGGGCAGGGGAAGGCCGCCTACGACAGGGCAATACGTCGGTCTGCACCAGGCGCAGAAGGAGAAGCTCGCCGAGGAGCGCAGGGTTCAGCGCCTGGCGGAGCAAGCCGAAGCAGACCGGCAAATCGCCGAAATGACGAGGGAGTTACCGGAGCTACGGTCCCACCGGCTCTCGGTAACCCCACAACCGGATATGACAATGGAGACGGAGGAGGATGAGGTCTCAGCAGCCGCGATCGGAAAGTCCATCGCGAAGAGTTTGGAGGTCATCACTAACGTGGCCTCCAAATCTAAAAATCTAAGTGGACCCTTCATAAAGGCTCTTAAGTTAGCCACGAAGGAAATTCAAGAGTCGACTGCGGCTCTACTCAGCCGCACGAAGTCGACGGAGACAAGGGCTCTGGAGGCGGCCAACGCCCGGCTCTCGAAAGAGCTTGCCGAGCTCAGAGCGGAACTCGCTGCGGTAAGGCGCGAGGTTAGAAGGACGAGTCCGGAGGTTGCCCGTGTGGCTCCTCCGATAAACATAGAAGAGGTGATGGAGCGAGCGGTGCGCGAGGCGGTCACCTTATCGAGTGCGAGGTTGAACGCGCGATTGGAGTCGTTGGAGTCCAGGCTCCTGCCCGCGCCACGTCTCCGCCCTCCCCTTGCCTCAGACGGGAAAGAGACCGCTGCCCAAGGCACGCAGCCAGCTGTGCCAAGGCCTCGGCAAAAGACTCCAGCCCCGGAAGCCGCGGTGGCGACTCTGTCGCCTCCAACGAACCCTGGTCCGGAGGAGAGGAAAAGGAGGAGAATACGGGCAACAGCTGCAGCGAAAGAAGCGACAGCCGCCAGGAAGGACGGCGGCCACAAAGAACCGACTTCGGAAAAAGCCCCCGCTGCTAGTAAATGGACCAAAGTTCCAAGTAAGAAGGAAGCTAAGGGGAAAAATAAACAGCAGAGGGCAGCGCCGGCGAAGAAGAAAGAGGAGAAGCGGAGAAATCTGCGAGCTCCAAAGTCTGCGGCGGTGGTGATCACCCTGCAGCCGGGTGCCACAGACCGGGGCGTCTCATATAAGGACGTCCTGGAAAGGGCAAAGAAAAGCGTGGATCTGGCGGCATTCGAAATACCGGCCGTCAGATTCAGAGTGGCTGCGACCGGGGCTCGGATGCTGGAGGTCTCCGGGTCGGCCTGCAAGGAAAGGGCCAACGCACTCGCAGGCAAACTCGTAGAGGTCTTGGGGGAAGACGTTCGCATCTCCAGACCTCAAAAGTGCGCCGAACTGCGAGTGTCAGGACTCGACGACTCGGCCTCCGCTTCCGAAATCGCGGAAGCCATCGCCAGGTCGAGCAACTGCGCGCCGGAAGAGGTGAAGATCGGCGAGATCCGGCGCAACCGAGGGCGCGGCACGGCCTGGGTCAAGTGCCCGGTGGAGGCCGCAAAAAGGGCGACCACCAAAGGCGCCAGGACGTACGTCGGGTGGACTGTGGTCCGGGTGGCACTCCTTGAAGCACGGCAAATGCGGTGCTTCAAGTGCCAAGAAGTGGGCCACACGCGGGCGACCTGCTCCAGCGAAGTTGACCGCAGCGAATCTTGTTTCCGCTGTGGTCAACCGGGGCACACCTCTGCGCAGTGTGAAAACGCACCACACTGCGTGCTCTGCGCGGAGAAGAACAAGCCGGCGAACCACAGCGTGGGCAGCAAGGCCTGCGGGAGACCGACGCCCAGGCAGAAGACCCCGAAGGCCCCGAAGAAGAAGGCCCCACAGAAGCCCGCCGCAATACCACCCACGGCGGCTTCAACGGCAGGAGAGGTCCCTATGGAAGCGTCACAATAG

Protein sequence:

>DPOGS205863-PA
MSEFANLCVRGFLVGTEVHPNLLLHSRDCKRLGDVRTEGGNTLPEALRRMRDLAKDWTIDSWADQLRQPRVSVALNKNVYNVNKKKRGKFFRARGGTSDSDESGSETVTSLRSEEDDGIERRRFRKRRGSEEIGTGSDKGAAKPTKRGRGRPPTTGQYVGLHQAQKEKLAEERRVQRLAEQAEADRQIAEMTRELPELRSHRLSVTPQPDMTMETEEDEVSAAAIGKSIAKSLEVITNVASKSKNLSGPFIKALKLATKEIQESTAALLSRTKSTETRALEAANARLSKELAELRAELAAVRREVRRTSPEVARVAPPINIEEVMERAVREAVTLSSARLNARLESLESRLLPAPRLRPPLASDGKETAAQGTQPAVPRPRQKTPAPEAAVATLSPPTNPGPEERKRRRIRATAAAKEATAARKDGGHKEPTSEKAPAASKWTKVPSKKEAKGKNKQQRAAPAKKKEEKRRNLRAPKSAAVVITLQPGATDRGVSYKDVLERAKKSVDLAAFEIPAVRFRVAATGARMLEVSGSACKERANALAGKLVEVLGEDVRISRPQKCAELRVSGLDDSASASEIAEAIARSSNCAPEEVKIGEIRRNRGRGTAWVKCPVEAAKRATTKGARTYVGWTVVRVALLEARQMRCFKCQEVGHTRATCSSEVDRSESCFRCGQPGHTSAQCENAPHCVLCAEKNKPANHSVGSKACGRPTPRQKTPKAPKKKAPQKPAAIPPTAASTAGEVPMEASQ-