Monarch geneset OGS2.0

DPOGS215417
TranscriptDPOGS215417-TA2997 bp
ProteinDPOGS215417-PA998 aa
Genomic positionDPSCF300088 + 730742-740524
RNAseq coverage1870x (Rank: top 7%)
Annotation
HeliconiusHMEL0174420.070.81% 
BombyxBGIBMGA012388-TA0.063.72% 
DrosophilaZn72D-PB5e-14642.86% 
EBI UniRef50UniRef50_D6X5414e-17248.74%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6X541_TRICA
NCBI RefSeqXP_971611.28e-17348.74%PREDICTED: similar to AGAP001905-PA [Tribolium castaneum]
NCBI nr blastpgi|2700007322e-17148.74%hypothetical protein TcasGA2_TC004366 [Tribolium castaneum]
NCBI nr blastxgi|2700007320.051.30%hypothetical protein TcasGA2_TC004366 [Tribolium castaneum]
Group
Gene OntologyGO:00082702.1e-07zinc ion binding
GO:00036762.1e-07nucleic acid binding
KEGG pathway 
InterPro domain[628-950] IPR0065613.1e-83DZF
[227-261] IPR0036042.1e-07Zinc finger, U1-type
[420-445] IPR0227551.5e-06Zinc finger, double-stranded RNA binding
Orthology groupMCL11296 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215417-TA
ATGATGGCTGCAAATAATTATTTTGGGTTTACCCATGGAGGGACCCAATATGGTGCGGCCACAGCTAGCGCAGCTTACGGCGGCCAAACGGGGTACGCCGTGGCGCCTGCTGCGACCGCCGCGACGTACGGAACACAACGTGCCGCAGCTACAGGATATGATACTGCGTACCAGGCAGCAGCCACGCAGGCGGCCCATGCCCACGCCCACGCTCACGCGGCTGCTGCGGCGGCGTCGGCGTATGACGCGAGCAAGTCTGCGTACTACCAGCAAGCAGCGGCCGCCTACCCCGCCGCTGCTCCCCCACAACCACAACCCACATATGATGCGGCGGCCAAGCCGGCTTACTCTACACCCGCAACATATGCGCAGCCGTTGGTGACTCAGTATGGCCAGGGCGGTGCTCGCGGGGGAGGCACCAAGGCGGCGTACGGCAGCGTGTACACCGCCACCACCGCCGCGTCCTACCCCAGCCAGCCCTACACCCAGCCCGCGGCACAACCCGCCAAGCAGACGGCTAACAGGGGCACCTCTGCTTATGATACAGCTCTCTACAACGCGGCGACCATGTATGTGGCCCAGCAGAACAAAACCGGAGGCGGAGGTGGCTGGAAGAACTATAAGGCCGGCGGAGCGGGTGCGGGCCAGGCGCGTCGCCCCAAGCCGCCTCCTAAGGCGCAACAGCTCCACTACTGCGACGTGTGCCGCATCTCGTGCGCCGGTCCTCAGACATACAAGGAACATCTCGAGGGACAGAAGCACAAGAAGAAGGAGGCGGCCGTTAAGCTGGCGGCGGCGGGTGGCGGAGGCGGCGCGCGGGCCTGCGGGGCCTCGGCGCTGCGCTGCGAGCTGTGCGACGTGACGTGCACCGGCGCGGACGCCTACGCCGCGCACGTGCGGGGCATCAAGCACCAGAAAGTGGTGAAGCTGCACACCATGCTGGGCAAGCCCATCCCCTCCACCGAGCCTACCAAGCTGCAGCCCGCTCAAAGTAGAAAAGGCCCATATACAAGTGACATCGACGATTTGAGAGAAAATTCGACTTTCCCACCGCTAGCCAAAAAGACGGTGGCCGGCGCGCCCAAGATCGCGTTCGTGGCGTCCGGGGGACTCAGCACCGTGGCCGCCGCTGACGCTCGCTCGCCCGCAGACAAGGACGACGACAAGGACGACGAGCCCGAGGCCGAACCCGAGGTGCAGCCCGTGGGGCAAGACTACATCGAGGAGATCCGGGGCGACGACGGCAAGGCTCTCAGCTTCAACTGCAAGCTGTGCGACTGCAAGTTCAACGACCCCAACGCCAAGGAGATGCACATGAAGGGGCGCCGGCACCGGCTGCAGTACAAGAAGAAGGTACAACCCGACCTGGAGGTGAAAGTGAAGCCGTCGATGCATCAGCGGAAGCTGGCCGAGGCGAAGGCGCAGCGCATGTTGGTGCGTGACGAGATATGGGCCAGGCGACGCATGCACGATGTACGGGACGAGGACGAGCGACTGTACTGGGAGGGAGCGGACTGGTGGAGGACGCCGCACCATCATCACCACCATGGCATGTCTCTGGGCTACGGGCCAGTGGGGCGGCGGCCGGAGACGTCGGACGACCGGCACGTTCTCGCCAAACACGCCGACATCTACCCCTCGGAGGCCCAGCTGCAGGACATACAGCGAGCCGTCTCTCACACCGAGAAGGCGCTCAAGTCGCTGTCGGACGCGCTCGCTGAACAGGCACGTCCTAAGGGTCAGCCATCAAAATCACAAATCAAGCAGGAAGATAAGAAGGAGGAGAAGGTGGAGGGGAAGGAAGACGGACGAGACAACCAACTGTTCTCGTTCGTGGCCGAGGGCGAGGCGGCTCCGTGCGCGACCTCTGGCGCCGCGCGCGCTTTGAAGGGAGTGATGCGAGTGGGTCTGCTGGCCAAGGGTCTGCTGCTGCGAGGAGACAAGGACGTGCGTCTCGTGGTGCTCTGCCACGACCGACCCACCGTCACGCTGCTCAAGAGGGTCGCCGCCGACCTGCCTGCGCATCTCAACAAGGTCAAGGGTCCGTCTGAGGAGCCTCGTTACAAGGTGGAGCTGCAGGCGGCCGAAGGAGCCGTGCAGGTGTCAGATGGGAACGTGGCCGTCAGGGTGTCGCTGACCTCCGCCGTCATGAGGGAACCGGCCGAGGGTGGCGACATAAAGCGAGACGACAAGGATGTGCTACCGCGGCAGAAGTGTTTGGACGCGTTGGCCGCCCTTCGGCACGCCAAGTGGTTCCAGATCGTCCGCCTACTACCGAAGAAACGAAACTTAAATAGAGGAAAGGAACGATCAGACAGACGTTTGAAGTCATTAGAGATAACTATAAGTAAGAGCGCAAGTTTCCTTTCTTCGTTAATACACGGACGGAAAGAGCAGAGCCAGCCGGCCCCCCGGCTACGGGGTGAGGGTGTAGGGTGTGGGCGGGTGAGGGTGGAGGGTGAGAGTGACGCGAGGGCTAGGGCGGCCAGTCTGCAGTCGTGCGTCATCATCATCCGCATCATGCGGGACCTCTGCCGCCGTGTACCCAACTGGACGCCGCTCAACCCATACGCGATGGAGTTGCTGGTGTCGGGTGTGATGCAGTCGGCAGGGGGCGCGTTGTCCCCGGGCGAGGCGCTCCGCCGGGTGTTGGAGGCGGTTGCGGGGGGCCTGCTGCTGGAACACGGCCCCGGCCTGAGGGATCCCTGCGAGAAGGATCTCGTTGACGCGCTGGGTAACTTGCCACCACAGAAGAGAGAAGACCTCACAGCATCAGCGCAACAGTTCCTGAGAATGATCGCCTTCAGACAGATACACAAAGTGTTGGAGATAGAACCTCTTCCCAAGTTGAAACACACCACGGGCTGGAAGTTCCCCCGCAAGCGACGCCGCTCCGCCGCCGACAACGAGTCAGACGCACCTAACGGTGAAGGCAAAGTAGTGAAGACGGAGGAAAAGGCGGACGCCACAGAGACGAGCGTCGCCGCCAAGAAATAA

Protein sequence:

>DPOGS215417-PA
MMAANNYFGFTHGGTQYGAATASAAYGGQTGYAVAPAATAATYGTQRAAATGYDTAYQAAATQAAHAHAHAHAAAAAASAYDASKSAYYQQAAAAYPAAAPPQPQPTYDAAAKPAYSTPATYAQPLVTQYGQGGARGGGTKAAYGSVYTATTAASYPSQPYTQPAAQPAKQTANRGTSAYDTALYNAATMYVAQQNKTGGGGGWKNYKAGGAGAGQARRPKPPPKAQQLHYCDVCRISCAGPQTYKEHLEGQKHKKKEAAVKLAAAGGGGGARACGASALRCELCDVTCTGADAYAAHVRGIKHQKVVKLHTMLGKPIPSTEPTKLQPAQSRKGPYTSDIDDLRENSTFPPLAKKTVAGAPKIAFVASGGLSTVAAADARSPADKDDDKDDEPEAEPEVQPVGQDYIEEIRGDDGKALSFNCKLCDCKFNDPNAKEMHMKGRRHRLQYKKKVQPDLEVKVKPSMHQRKLAEAKAQRMLVRDEIWARRRMHDVRDEDERLYWEGADWWRTPHHHHHHGMSLGYGPVGRRPETSDDRHVLAKHADIYPSEAQLQDIQRAVSHTEKALKSLSDALAEQARPKGQPSKSQIKQEDKKEEKVEGKEDGRDNQLFSFVAEGEAAPCATSGAARALKGVMRVGLLAKGLLLRGDKDVRLVVLCHDRPTVTLLKRVAADLPAHLNKVKGPSEEPRYKVELQAAEGAVQVSDGNVAVRVSLTSAVMREPAEGGDIKRDDKDVLPRQKCLDALAALRHAKWFQIVRLLPKKRNLNRGKERSDRRLKSLEITISKSASFLSSLIHGRKEQSQPAPRLRGEGVGCGRVRVEGESDARARAASLQSCVIIIRIMRDLCRRVPNWTPLNPYAMELLVSGVMQSAGGALSPGEALRRVLEAVAGGLLLEHGPGLRDPCEKDLVDALGNLPPQKREDLTASAQQFLRMIAFRQIHKVLEIEPLPKLKHTTGWKFPRKRRRSAADNESDAPNGEGKVVKTEEKADATETSVAAKK-