Monarch geneset OGS2.0

DPOGS212171
TranscriptDPOGS212171-TA2118 bp
ProteinDPOGS212171-PA705 aa
Genomic positionDPSCF300038 + 1016756-1020631
RNAseq coverage209x (Rank: top 46%)
Annotation
HeliconiusHMEL0125598e-14255.58% 
BombyxBGIBMGA006617-TA6e-11949.79% 
DrosophilaCG4622-PA3e-5236.47% 
EBI UniRef50UniRef50_E1ZXA24e-7535.26%Zinc finger CCHC domain-containing protein 8 n=5 Tax=cellular organisms RepID=E1ZXA2_CAMFO
NCBI RefSeqXP_001122586.11e-5636.67%PREDICTED: similar to CG4622-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3071899561e-7435.26%Zinc finger CCHC domain-containing protein 8 [Camponotus floridanus]
NCBI nr blastxgi|3227990131e-8334.43%hypothetical protein SINV_12227 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[278-324] IPR0065681.2e-16PSP, proline-rich
Orthology groupMCL15831 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212171-TA
ATGGCTAAAAGAAAAGCAGCCGTCAATGATATAATATTCGAATTAGACAACGATGACATAGTTATAAGCAGCGATGAAGAAAGTAAGGTATCTAAAATGGGACGTTTTGATGGAGAAAACAGTCAAAACATCCTAAAGAAGAAGCTGAATAAAGAAGAAATAGTTTCTGATGTTATTAATTTAGACAGTCCTAATGAACCTGAACGAAATATTCCATCAACAAAGAAAACTGAGAATGGAGGGAAAAACAAAACCGAGGTAACCTCCAAACAAAACAATTCAAAACAAAATGACATATCTCAAAATAATGAAGTGATTGATATTACCGAAGAAAACAAGAAAAGCTGTACTAGTACCATTATATTGAATGAAGACATAGTGATTGATTCTCCATCTTCAAATATAGACCTTGGTGTTGTAGGATGCGAGAACAAAGCTCCATTGGTATCAATACGATTTAATGACAGTCTTACGGCACAGACTGATAATGACCTTGAAATATGGCCGGAAGACATAGTTGACGAAGAATTTAATTCAAAAGAAAATCCTGTTGAGGATAATCTTTTCTTCGTTGATACCACGCCTTGTAATGAAGGAAACAAAGATATACCTTTATATAAAGCTACTAAAATAATTTCCAATGACACCGAAAAAGAAACTATGTCACAACCCATCAAACGAGCTTTCTCTTGTTTCAATTGTGGGGATTCCCATCTTCTAAGAGACTGTCCTTTGCCACGAAATAATTCTAAGATAAATGAAAAACGAAAAGCTTTTACTCCTAAAGGCCGTTATCATGTTGAAAATGAGCAAAAATATGGTCACTTGATACCAGGGCGTATATCTGCAGATTTGCGTCATGCTCTGGGCTTAAAGCGTTATGAGTTACCGCTGCACATATACCGTATGAGATTACTCGGTTATCCTCCGGGATGGTTAGAGGATGCTAGGATATCACATTCAGGGATTACTCTGTTTGATTCAACTGGCCGAGCCACACTAGGTCCTGACGATGAAGAGGGGGAATTGTATGAGCCAGGGTCAAAGGATAAATTTGATATTAAAAAGATATTGGACTTCCCGGGTTTTAATGTTGCTGCGAGCTCAAGATATATTGAGGAAGCACATTTGTTTGGACTCCCTCCTATGTCGGAGCAAGACAGCAAGATAGCTATGTTAAAAATACTGGCTCCAAACGCTATGAAGGCGTACAAACGGAAGAAGTTATCATTCTTCCCGTCAGCTCTGACAAGCTCTACCCAAGAAGAGCAGGTTGAAATGGAACTTGATAGTGGCGATGAAGTGGCAGACTTTCCTCTGATACCACCTCTGCCGGATGATGAACCTCCCGCACCACCACCACCACCACCACAACCACAAGATACCAGACAACAAGAAGACCAGTCCAAACAAAGCATAGAAACCACCAAACCGATTGAATCAAATGATAAAAACCAAATTAATGATGAGAAACTTTCGGAAACGGATGTAGACAAAACAGAGTCGGCTCCTGCAAACAATGACTGTGACAAGTCCAATGACCTTGAAGTGATTGAGGTAGTAAGAGTGAATGATATTCCAATACCAGAAGACGATCTGATTGTCATTGACGATGACGACAAATCATCACTCAGCAGTGACAGAAATAGCCCGAGTCTGGCTGACTTAGAAAAACGGAAGCAAAAGCTCTTGGATGCTCTCAAAGGAGATTCAATTTCTATGACAGAGGTCTCAGTTGAGAGTATCGATACGTATGACACGAGCGAGGAAAAAAACAGTGAAGTGAACAAGTGTGAAGTTGATAATATAACACTCTCCACAGAAGACAGTAGCTGTGACGGAGTGCAGAACAAAACGCTTAAGGAAGAAAGTGCAACAGACTCGGACCAAGTACCCTCGACCACTAAGACGGGTCAGATAAAAAATACCCATTATGGCACACCCGTGATAAACGTCGCGTCGCCGTATCACAAACTGCCCAGTGACGTTAATTTCGCTAAGGACATATGTGACGTCATAAACTTTGAGAATCTACCCAATTCAGTTGGCAAATACAAAAAAATATGCACATTGCTCAAGAAGGTTAAATCGGAAGTGGATAGGATACAGGAATCGTGA

Protein sequence:

>DPOGS212171-PA
MAKRKAAVNDIIFELDNDDIVISSDEESKVSKMGRFDGENSQNILKKKLNKEEIVSDVINLDSPNEPERNIPSTKKTENGGKNKTEVTSKQNNSKQNDISQNNEVIDITEENKKSCTSTIILNEDIVIDSPSSNIDLGVVGCENKAPLVSIRFNDSLTAQTDNDLEIWPEDIVDEEFNSKENPVEDNLFFVDTTPCNEGNKDIPLYKATKIISNDTEKETMSQPIKRAFSCFNCGDSHLLRDCPLPRNNSKINEKRKAFTPKGRYHVENEQKYGHLIPGRISADLRHALGLKRYELPLHIYRMRLLGYPPGWLEDARISHSGITLFDSTGRATLGPDDEEGELYEPGSKDKFDIKKILDFPGFNVAASSRYIEEAHLFGLPPMSEQDSKIAMLKILAPNAMKAYKRKKLSFFPSALTSSTQEEQVEMELDSGDEVADFPLIPPLPDDEPPAPPPPPPQPQDTRQQEDQSKQSIETTKPIESNDKNQINDEKLSETDVDKTESAPANNDCDKSNDLEVIEVVRVNDIPIPEDDLIVIDDDDKSSLSSDRNSPSLADLEKRKQKLLDALKGDSISMTEVSVESIDTYDTSEEKNSEVNKCEVDNITLSTEDSSCDGVQNKTLKEESATDSDQVPSTTKTGQIKNTHYGTPVINVASPYHKLPSDVNFAKDICDVINFENLPNSVGKYKKICTLLKKVKSEVDRIQES-