Monarch geneset OGS2.0

DPOGS202650
TranscriptDPOGS202650-TA3603 bp
ProteinDPOGS202650-PA1200 aa
Genomic positionDPSCF300039 - 287540-297632
RNAseq coverage469x (Rank: top 26%)
Annotation
HeliconiusHMEL0079540.067.91% 
BombyxBGIBMGA000848-TA0.063.03% 
Drosophilajing-PI9e-5942.91% 
EBI UniRef50UniRef50_F4WE753e-6947.54%Zinc finger protein jing-like protein n=6 Tax=Formicidae RepID=F4WE75_ACREC
NCBI RefSeqXP_392199.33e-7049.22%PREDICTED: similar to jing CG9403-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3407286806e-7248.70%PREDICTED: hypothetical protein LOC100651060 [Bombus terrestris]
NCBI nr blastxgi|3407286802e-7043.47%PREDICTED: hypothetical protein LOC100651060 [Bombus terrestris]
Group
Gene OntologyGO:00036768e-08nucleic acid binding
KEGG pathway 
InterPro domain[930-970] IPR0130878e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26525 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202650-TA
ATGAATGACTCCGCTATGGTTCACGCATTTGATTGTACTACGGCGAGTGGCGTGGCGGTGCTGAGTCCTACGGAAAGCAGTGGGTGCAGCGGTAGTGACATCACTGAGCCTCGCTCGCCCCGGAGCCCCGAGTCGGGGTGCAGCACGAGTGAAGAAGGCGCTGGCGTGCGCATGCCGCCTTGGGCGAGCGATCCGGCGCCGCCGGGCCCGAGACGCCTCGCGGCTCAACCCGCGCCCCTCCGGCGACCCAATCCACCTATACATCGCCGCATAACAGAATACTTCAATCATAAAATGAAGCCACAAAACGGCGTCAAGCGTGACTCGAGTTTAATTAGTAACGATGACAGCAAGAAAAAGTGTCTCCCTAAGAATGGTGTCACGCATGATCTCGAAAAGTATATTTCGTATTTATCTCAGACGCTGAATCCAAAAGTTCTAATTGATGTCGAAAAACCAGCAGACTCTATGAAAAAGCCTGTCAGTCAAGCAACAAATTTCAATACAGATGTAACTAAAAATGGTCTACATAAAACAGCTTCATCTACTAATATCAATACTTTAAGAGATCGTATCAAACCTAAAGACAATAAAGAGACTCACACAATGAACGGTTTGGTAGATATCCGTCACGGCACTGAGTCGTCCAACAAAGAAAAAAGTTTTGAGGGCTTCTTTAATGGGGTAACTGTCAGCAGAATACCTAACCCTAAAGTAAATCAAGGACCATCCGCTATAACACCAGTAAACCATATTAACAAATGTAAAATTCCAATTCCATCAGTTAACCAATCACCCAAAAAACAAGTGAATCATAAACCTGATACAACACATCTAAATGGAGTCGTGAGTTCCACGACTAAGCTAGACAGTCGAATGAACGGGGTTACCTGTGGGTCTAACAACTTAAATCTTATGAACAAACCATCTACAGTAAAAAAGGAACCTGTTAAGAGAAATTTACAACCTAAGAGAACCAATCGCAAATCATCAGCTTGTATAGCTAATAGCAAGAACTTGACTTGGTCGAAGGCTAATAATCTTAAACAGGTACAGTTGCAGAAACAAAATTGTAGTAAAATTATTCCAAACACGCCTGTTAATAACAATGTACCAGGTGAAGCAAGCACTGCGCCAAAGATGCAAATCCCAAATGGAGTAAGACAAGTATCGATGGCAGTCAACGGAAATATAAGCAATATAAATGTGCCTGTGACTAATTTTCAAAGTTGTCATTCATTAAATATTCCACAACCCTGTAATGGGACGGTAACCTCCACCAATTTGGCTTCGTTCGTCGCTGTTCCTCATAACGTAATGCTTACAACACTTCGAATACCACAAAACGGTCTACCTCCGCAGACATTAAACCAACAAGGTAATATGGCATTTAATCGTCAAAACGTCAACGTGACAAGTCCGACTAAATTAAACGGTCAATTCGTTTTCCCGTTGGTTCAGAATATTAATGGAACTGTCGTTCAAATACCAAATTTAATGGCCAAGATGCCGAATTTCGTTCTTCCGCAAACTCCTCAAATAACAACGCAGAGACAAGATCATCTACAAAACCAAACTTTAAACCAATTTCTCATTAATGGTACTTTATTGAAATTGGCAAACACGATGCCACCTACCTATACAACGGCGAACAAATCTGGTCCAGTCACTAGTCAGCCACTACCAGTAATTAACAAAAATGTGCCTGGTCTTCAAGCTGTCGGAATGTCAGTTCAACCAAAACAGAACTTCACAGTGAACCTTAGCCATCCGATGCTAGTGCCTCAACCAGGATTTATTGTAACATCTGTGCCAAATATGAGTGTGAAAGAAACGTGGAGTTATACCGGTACCCATTCGGTTGCCTCGTCACAGATATCGTGTTCGAGTCCAATAGTTCCACAACAGCCCGCGATGGTTGCCAATTTTATGAATCAAAATCTCCACTCGAGTAGTATTCCTATAGCCCCCGTGAAGCCACCGGACAATCCGATTCAAGGTACAGAGCCGCCCAGCAGTACTGTTAAAGAGACGGAAATTAATGAAATTGCCAATCAAAAGTGTGATATTCCATGGCTGTCAAAAACTGTGAACAATAATATTGTTCCTTTAGATACTAATAAATTAACGAGGCCTTTTGCACAAGTTAGTAGTTCAGGGAATCTCAAAGATTTCAGCATAGAGAAGGTTGGAAAGAAACGAAGAGCGAGTAAAGATTTACAGAAGGATTTTAAAGTTGGTAGTACGAGTGAAATAAGAAAAGTAGAAGAAGTCGCCAGAAGAGAAGAGACAATATTTACACAAATAACCGTATTGAAGGATGTTTCGTCTAATGTCCAAAGTTCTGTTATAGAAGCAAATTTCTGTAGTGCTACTACAGAGTCTTCAGAGTCCGGTATAGGAACCGATAAGTCTATTGACTCGCCCAGCGACTCCCAGGCTAGCAAAGAGTGTGACGATGACTCCTCCTTATCATTAAGTATCAGTGTGAGTTCCGTCGAAGCTCAGAGTAGTCAAAAAAGTCCTATACTAAAACAACCTAAAACATTACGGTTTCCACCAAGAAGTATTATCAAACCTCAAAGCGATAAGAGAAAATCTAGCACTGATACGACGTCGGTCACGATATGCTTATGGGAGAAGTGTAAGCAAGAGTTAGAGAGTGATAGCGATCTCCTGGAACACCTGCAGTCTGTCCACGTGGAGTCTCAAGCTGGTCAAGAGAACTACGTATGTCTGTGGGAGCGCTGCAAGGTCCGCGGCAAACCTTCCTGCTCTCGGCTATGGCTGGAACGACACGCTCTCTCCCACGGCGGAAACAAACCGTTCAAGTGCATCGTTGACAGCTGCGACAGGAGATTCTCTACTCAGATATTACTCGAACGTCACGTAAACAATCACTTCAACGAGGCGACGCCGAACACGGGGAACGGGAAGAAGACCGCAGACAGCTCGGCGAAGATGATACGACGGAATGGAAAGAAACTCAGATACAGAAGACAACCGTGGTCTGCCCGCATGTTCGACTTCTTCGACTCCGGCATGATGGAGGGTCTAGCGTGGCGGCTCGCCCGCAGTACTCGCTGGCGGTTGGGTGGGTTCTGCCCACTGAGGGACGGCGGCGGACACACGCTCACACTGCACGCCGCGCTCACAGCGACCAGATACAACCCGGCCACCGCCGCGCGCGAGGCGCTACTGACGTACTACCCGCCGCATGTTATCGAGGACGAGTGGGTGCCCGAGGAGGAAGTGAAACGGGTCAAAAGGGTGGAGATCTCCGAGCTGCCGGTCCACACCAAGGTGCTGCTGTACGAGGAGTTCTGCAACTCGTACAGGAAGACTCCGCCGCCGCCGCCCAAGAAGACGCAGGCTGCGGCGCAACCCGTCCGCCTGCTTCCGGCCAGGCAGTGCACATCGTCACGCCTGCTCAACAACCTGATAGCGAAGCAGCGCAAGGAGAGAAACGACTGCAGCTCCACCGGCCTGCCGCCGCTGTTCGACGACGACAGAGACAGAGAGAAGAGCGAGGTCAGCGGCGCCAAGAAGAGGAAGATGGGCAGGAGGTACGGCGGGAACGCGTTCTGGCCCTGCGGACGATAG

Protein sequence:

>DPOGS202650-PA
MNDSAMVHAFDCTTASGVAVLSPTESSGCSGSDITEPRSPRSPESGCSTSEEGAGVRMPPWASDPAPPGPRRLAAQPAPLRRPNPPIHRRITEYFNHKMKPQNGVKRDSSLISNDDSKKKCLPKNGVTHDLEKYISYLSQTLNPKVLIDVEKPADSMKKPVSQATNFNTDVTKNGLHKTASSTNINTLRDRIKPKDNKETHTMNGLVDIRHGTESSNKEKSFEGFFNGVTVSRIPNPKVNQGPSAITPVNHINKCKIPIPSVNQSPKKQVNHKPDTTHLNGVVSSTTKLDSRMNGVTCGSNNLNLMNKPSTVKKEPVKRNLQPKRTNRKSSACIANSKNLTWSKANNLKQVQLQKQNCSKIIPNTPVNNNVPGEASTAPKMQIPNGVRQVSMAVNGNISNINVPVTNFQSCHSLNIPQPCNGTVTSTNLASFVAVPHNVMLTTLRIPQNGLPPQTLNQQGNMAFNRQNVNVTSPTKLNGQFVFPLVQNINGTVVQIPNLMAKMPNFVLPQTPQITTQRQDHLQNQTLNQFLINGTLLKLANTMPPTYTTANKSGPVTSQPLPVINKNVPGLQAVGMSVQPKQNFTVNLSHPMLVPQPGFIVTSVPNMSVKETWSYTGTHSVASSQISCSSPIVPQQPAMVANFMNQNLHSSSIPIAPVKPPDNPIQGTEPPSSTVKETEINEIANQKCDIPWLSKTVNNNIVPLDTNKLTRPFAQVSSSGNLKDFSIEKVGKKRRASKDLQKDFKVGSTSEIRKVEEVARREETIFTQITVLKDVSSNVQSSVIEANFCSATTESSESGIGTDKSIDSPSDSQASKECDDDSSLSLSISVSSVEAQSSQKSPILKQPKTLRFPPRSIIKPQSDKRKSSTDTTSVTICLWEKCKQELESDSDLLEHLQSVHVESQAGQENYVCLWERCKVRGKPSCSRLWLERHALSHGGNKPFKCIVDSCDRRFSTQILLERHVNNHFNEATPNTGNGKKTADSSAKMIRRNGKKLRYRRQPWSARMFDFFDSGMMEGLAWRLARSTRWRLGGFCPLRDGGGHTLTLHAALTATRYNPATAAREALLTYYPPHVIEDEWVPEEEVKRVKRVEISELPVHTKVLLYEEFCNSYRKTPPPPPKKTQAAAQPVRLLPARQCTSSRLLNNLIAKQRKERNDCSSTGLPPLFDDDRDREKSEVSGAKKRKMGRRYGGNAFWPCGR-