Monarch geneset OGS2.0

DPOGS205825
TranscriptDPOGS205825-TA1422 bp
ProteinDPOGS205825-PA473 aa
Genomic positionDPSCF300081 - 450135-469465
RNAseq coverage5x (Rank: top 88%)
Annotation
Heliconius% 
BombyxBGIBMGA014462-TA3e-1332.04% 
Drosophila% 
EBI UniRef50UniRef50_B7S8P82e-0631.25%Retroelement polyprotein n=15 Tax=Endopterygota RepID=B7S8P8_9HYME
NCBI RefSeq%
NCBI nr blastpgi|1907023806e-0631.25%retroelement polyprotein [Glyptapanteles flavicoxis]
NCBI nr blastxgi|1907023803e-0730.08%retroelement polyprotein [Glyptapanteles flavicoxis]
Group
Gene OntologyGO:00082708.1e-11zinc ion binding
GO:00036768.1e-11nucleic acid binding
KEGG pathway 
InterPro domain[279-351] IPR0130848.1e-11Zinc finger, CCHC retroviral-type
[286-301] IPR0018786.1e-07Zinc finger, CCHC-type
Orthology groupMCL27814 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205825-TA
ATGAGTTCACAATCACTAAGCGTTAGCGGAAAACTCATAAGCGATAGCGGAAGTGGGTCATCACCGGATTTACGTTGTAATCTACTGATGTTACAGTGCGAATCTTTTTCCTTTAGCGAATGTGATTGTGCCTCTTTGATTAACGAATTTCGGAAGGAGATGATGTCTTTTTTCCAAGATTTTACAAAAACGCACCAGGAATGTTCCTTATATTTACATAATGAAATAAAAGAAATAAAAGATGAATTAAAGTCTAATAATGCTACTTTAGAATCTTTAATATTGGAGCAAAACCAAACGAAAGTCGACTTAAAAAAAATGAAGTTACTAACTGATACTATAGAACAAAAAATACACGTTCTTGAAAATGATATTTGTTATTTAAAAAATAACTCTAAAACTGATTATCCTAAAGAATATCCCCTGTCCTGCGAGGAAACCATTTCAGAGCTACAAGAACGCTCTAAGCGAGAAAAAAATGTCATACTAGTAGGGATTCCCGAGACAAAAAGTCCTAATAGCACAATCAGACGTGATAATGACAAGGATGAAGCTATGAAATTAATTAAAGCTATTTATGAGGACTGCCCAAACCCTACTACAGTCTTTAGGCTAGGAAAATACAACCCTAATAAAGATAGATCTCTAAAAGTGTGTTTTGAGAAAAAGGACACTGCAAAATATATACTTAAAAATAAAAGCAAACATAAGTTAGACAAAATCCGAATTTACGCCGATCAAACTCAGTGTCAACAACAACACATGAAAAAACTACAAGATGAACTGAAAAGACCCTTGCCAGTGTCTGAAAGAAATTATTCATCGACTGAAGCCAAAAAGCCAAAATTTTACCGTTCTTGCTACAACTGTGGGAAAACAGGACATCTAGCGAATGAATGTCGACGCGTTGTTTTAAAAGAAAATTCTTCTGTAACAAATCAGGATCCTAATCAAGCAAATGCCTCGATGCAACGTAGAGAACCATCACAGAGGACTATCATATGTTACCGATGTGGACAACCAGGACATATAGCACCCTCGTGTCCTACTAAATCTGGGAAAACTTTTGAACACCCAGATCCGACCGAACGACGGGTAGATATCTGCACCGTCAGTCCTGCTTCAGGAGACCTTTATAAGCTGGATCCTAATCTTATTGGATGCGCCGCTGACGGCACAAAGACATATCAGCTACTATACGCCAATATGGAGCTGAATGTTAGGCTACCAAGAAAAAGTACGGATGCAAAAGAAATGAGAATGGCAAAAGACGAGAATGAACAGGACAAGGAATCAGTATGTCTGGGGCAGTCTGAAAGAGGCAAGCTGATGAAAGCTACGATTATGGATGTTGGAAGAAGAAAAGGTAGAGAAAGGCCAAAGAAAATATATTTGCAGAAACAGTTGTGTGTGATAAGGTGA

Protein sequence:

>DPOGS205825-PA
MSSQSLSVSGKLISDSGSGSSPDLRCNLLMLQCESFSFSECDCASLINEFRKEMMSFFQDFTKTHQECSLYLHNEIKEIKDELKSNNATLESLILEQNQTKVDLKKMKLLTDTIEQKIHVLENDICYLKNNSKTDYPKEYPLSCEETISELQERSKREKNVILVGIPETKSPNSTIRRDNDKDEAMKLIKAIYEDCPNPTTVFRLGKYNPNKDRSLKVCFEKKDTAKYILKNKSKHKLDKIRIYADQTQCQQQHMKKLQDELKRPLPVSERNYSSTEAKKPKFYRSCYNCGKTGHLANECRRVVLKENSSVTNQDPNQANASMQRREPSQRTIICYRCGQPGHIAPSCPTKSGKTFEHPDPTERRVDICTVSPASGDLYKLDPNLIGCAADGTKTYQLLYANMELNVRLPRKSTDAKEMRMAKDENEQDKESVCLGQSERGKLMKATIMDVGRRKGRERPKKIYLQKQLCVIR-