Monarch geneset OGS2.0

DPOGS211271
TranscriptDPOGS211271-TA1596 bp
ProteinDPOGS211271-PA531 aa
Genomic positionDPSCF300411 - 10195-12161
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0166164e-1522.71% 
BombyxBGIBMGA013162-TA2e-1426.20% 
Drosophila% 
EBI UniRef50UniRef50_UPI00021A83B81e-6637.47%UPI00021A83B8 related cluster n=3 Tax=unknown RepID=UPI00021A83B8
NCBI RefSeqXP_001948286.13e-4929.22%PREDICTED: similar to blastopia polyprotein [Acyrthosiphon pisum]
NCBI nr blastpgi|3407274534e-6637.47%PREDICTED: hypothetical protein LOC100645895 [Bombus terrestris]
NCBI nr blastxgi|3407274537e-7135.69%PREDICTED: hypothetical protein LOC100645895 [Bombus terrestris]
Group
Gene OntologyGO:00082702.5e-10zinc ion binding
GO:00036762.5e-10nucleic acid binding
KEGG pathway 
InterPro domain[285-327] IPR0130842.5e-10Zinc finger, CCHC retroviral-type
[335-444] IPR0211092.7e-10Peptidase aspartic
[346-441] IPR0180618.8e-06Peptidase A2A, retrovirus RVP subgroup
Orthology groupMCL18189 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211271-TA
ATGGCTGACGATGAAGAAGAGGAAGATGAAGCAGATGAGGAAGACGAGGCCGACGCCGAAGCCGACGGAGGTCATGAAGGGAACGACGCTGAAGAGGCCAACGCTATTCGACGTAGAGATGAGCGTGCAGGAGGCAGACACAACGATGAACGTCACCGAAATTCTGCCGCTGGCCGCCGAAGAATTAATTACGACGTTAACGACGACGATGCTGATGATGAGAGCATTCGTGACGCGACACCCGTGATTCGTAGACGAGAAAACGTATCACGTGTGCTTTTAACTTTTAGAGACGTGGGGGGTTCGTTGAGAAAATTCAACGGAGACAAACAGGAAAACGTTCGACGATGGCTGGAAGACTTTGAGGAGATGGCCTCATTGTGCCAGTGGAACGAGATCCAAAAAATTGCATATGCCAAACGACTGTTGGATGGCTCAGCTAGGTTGTTCGTAGAATATGAGCGTTGTGCGAAGACCTGGACTAAGCTAAAGGAGGCATTGACAGACGAGTTCGAGAAAACGGTGGATACTTTAAAAGTCCATCGAGAACTAACTAGAAGAAAGAAAAAGCCCGACGAAACATATCAGGAATACGCCTATAAAATGATGAAGATAGCCGATCAGGCAAGTTTCGATGCGCGAACCACCATTCGGTATATTATAGAGTGTATTCCTGATGACGCATCGAACAAACAGATGTTGTACGGTGCGAGAAGTATGCGGCAACTAAAAGAAAAGCTGGAGGAGTACGAGGAGATGAAGGACGCGATGCCGAAGTCGAAGATAAGTTCAGCAGACAAGAGAGATGACAAAGCAGTGCACGACAAGAAGAAGTTCGGAGGACCAGCGGCGAAAACTACGAAGCGTTGCTATAATTGCGGCGCTGAAGATCATATGAGTGCGGCGTGTCCTGCTAAGGAAAAGGGCAAAAAGTGCTTTAAGTGTGGTGAGTTCGGACACATTTCATCTGAATGCCAGGCTAAACCTAAAGATGTGTATACAGTGTCACGGCCGAAAAAGGAAAAATATCGGAAACTGGTGGAGATTAAGAATCGTAAGATATTAGTTGTAGTTGACACTGGCAGTGATTTAACCCTGATGCGTGAGAGTGTGTATGAGAGTTTGGGATCGCCTCGGTTGCAGAAGAAGCAGGTATGTTTTGAGGGTCTTGGGTCTGAAATGAATGAAACAATAGGCATGTTTACCGCTAACATGCGGGTAGAGAATGATGAGTTCACTGCCGATTTTCATGTCGTACCTGACACAATACTTAAATACTCTGTCCTGCTCGGTACAGACTTTCTGGACGAATTTGAATTGCGCGTAAGACGAGGAGAGGTAACATTTGTTAAGTTAGATGACCACGATGACACTAACGCTGTCGGATCAAAGCAACCCTATATTTTTAGTATTAATGCAGTTAATGAGGGACCTGAAGTAGACTTATCTCACATTGAAAACGAAGAATGTCGAATAGAAATTAATAAAATGGTCGAGACTTATAAACCGGTAGCAACTCGCGATATTGGTATACGAGCAAAAATTGTTTTAAAATCAGACCAAGTGTTGGCTCGTCGCCAGGATCTATTCCCTTGA

Protein sequence:

>DPOGS211271-PA
MADDEEEEDEADEEDEADAEADGGHEGNDAEEANAIRRRDERAGGRHNDERHRNSAAGRRRINYDVNDDDADDESIRDATPVIRRRENVSRVLLTFRDVGGSLRKFNGDKQENVRRWLEDFEEMASLCQWNEIQKIAYAKRLLDGSARLFVEYERCAKTWTKLKEALTDEFEKTVDTLKVHRELTRRKKKPDETYQEYAYKMMKIADQASFDARTTIRYIIECIPDDASNKQMLYGARSMRQLKEKLEEYEEMKDAMPKSKISSADKRDDKAVHDKKKFGGPAAKTTKRCYNCGAEDHMSAACPAKEKGKKCFKCGEFGHISSECQAKPKDVYTVSRPKKEKYRKLVEIKNRKILVVVDTGSDLTLMRESVYESLGSPRLQKKQVCFEGLGSEMNETIGMFTANMRVENDEFTADFHVVPDTILKYSVLLGTDFLDEFELRVRRGEVTFVKLDDHDDTNAVGSKQPYIFSINAVNEGPEVDLSHIENEECRIEINKMVETYKPVATRDIGIRAKIVLKSDQVLARRQDLFP-