Monarch geneset OGS2.0

DPOGS212848
TranscriptDPOGS212848-TA2964 bp
ProteinDPOGS212848-PA987 aa
Genomic positionDPSCF300086 + 223324-227166
RNAseq coverage68x (Rank: top 67%)
Annotation
HeliconiusHMEL0101243e-17739.22% 
BombyxBGIBMGA000792-TA0.044.96% 
Drosophilacrol-PE3e-5031.87% 
EBI UniRef50UniRef50_E7F8Z93e-9127.05%Uncharacterized protein n=61 Tax=Danio rerio RepID=E7F8Z9_DANRE
NCBI RefSeqXP_001950800.18e-8830.50%PREDICTED: similar to mCG142610 [Acyrthosiphon pisum]
NCBI nr blastpgi|3266672554e-9329.61%PREDICTED: zinc finger protein 91-like [Danio rerio]
NCBI nr blastxgi|3266672556e-10929.61%PREDICTED: zinc finger protein 91-like [Danio rerio]
Group
Gene OntologyGO:00036762.4e-15nucleic acid binding
GO:00056341.2e-10nucleus
GO:00082701.2e-10zinc ion binding
GO:00056226e-06intracellular
KEGG pathway 
InterPro domain[672-700] IPR0130872.4e-15Zinc finger, C2H2-type/integrase, DNA-binding
[31-104] IPR0129341.2e-10Zinc finger, AD-type
[653-675] IPR0070876e-06Zinc finger, C2H2
Orthology groupMCL19019 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212848-TA
ATGAACGGTGGAACCGCTTCAGACCAGCTAGACGAACTTGATATTAAATGCAGCGGCGAAACTCATGAGAACCTGCAGGATTCTCCAAATATATGTCGAATATGTGCCACCGTCACAGACCTGGTCATTCCTATATTTGAAGGAGAAGGACTACAAAACAACTTAGCAGAAAAAATACATAAACATTTACCTATAAAGGTGTCTGTGCAGGATGTGCTGCCCCAGGTGGTGTGCTACCAGTGCTCCAGTACTTTGCTCGCCTGGCATGAGCTGGTGCAATGCTGTCAACAAGCGGACCAAGCTCTCCGACAGCAGGCAGCCCATAGGGAGAGGAAAGCAAACGAAGTAACAGGAACAACATCAGATCCAAAGGCTTTGAGCTTGTTAACATCCTCAGTGCGAGGTGTGTTGAGTGATTACTGTCAAATGCTTAATATAAATCAGGAAACTTCTGACATCTGTTATGTCTGTCAAGAATGTGACGGACATCCCGTCTCAAGCTCTATAGAAAATTTATCAGAGCATCTGCAATTGGTACACAAGGAAGTCCTCATCTCTGATGAATTATTTGGTAGCGATGATCTCGGCGAGCCTGCTCCTCAGAAAGAGTTGCCAAACTATTCCTGTCCTTTCTGTGAGAGCATGTTTTCTTCACCCACTAGACTCATTTTCCACCTGAACTGTCATCTCGAGGTCTGTATCGACGACGGAGTGTACTGTTGTGACCAACTGTTCGATAATAAAACATCTTTCGTCAGTCACCTACAATCTCGACACGTTCGTAAAGTTATCGAATCGTCTTACGTGTGCAAGAGCTGTGGCCTCACAGCCGGCGACCTGGCCGAGCTCCAGAAACATATTAACGATAATCATCCCGAAGCGGAGGACAGATATGAGAAGGGCAAGACAGAAGGGAGTCCCAAATGTCAGAAATTTATTCCCGCCGTGTGTTCCGAGTGCAACAAAACTTTCTCTAACAAGTACAACATGCTGGTGCACATGAGGAACCATTTCGGACCAGCGAGTCGGTTCGCGTGCGGCAAGTGCAACAAGACTTACAAGAGCCAAGGCAGCCTCATATACCACCATAAGGTCGTCCACGAGGGACAGCTCAAGTTCGTGTGCTCGTCGTGCGGAGAGGCCTTCCCGTCACGAGCGGCGAGAGACGTACACGCGCGCCTCCACACTGGTCAGAGACCTTTTTCATGCCAATACTGTGGGAAGGCCTATCGAGCTAAGAACACTCTATACCGACATATAGACATGCATCTGAACATAAGAAAATATGCCTGCAACTTTTGTGATCGAAAGTTCCGAAAGAGTACACACCTTAAGTGTCACTTGCGAACGCACGAGAGGACGACATGGATCCCATCTCCCGACCCATACTTCTTGAAGAGGAGAGAATTTAAAATTGAACTCAATGAACCGACACACAGTAATGATACTGAGCCAGTCACAGCTCATCGCAGTCGCGGATCGGATTCCGACGAAGACGACAAACCTCTCGCTGAGTTCGCGGCCGGCAAGCCGTCAGATATATACAGAAACTTTTATAGAGCGTTAACAAAGTTTCGAGATCACTACGTTCAACATGAAATTAAGACAGATCGCTGCTCAGACTTAGCGAACTCGAGCGATTCGGAGGAGGAAAGAAACTGTGAAGACCTGGATCCGACTCAGTACGACGACCTCTCTAACAGCAACATGAGGAAAGACAAAATGAACGAGGAAACGCGACTCGAGCTCAGCCAGGTGCAGACGAAAATAAACGGGAAGACTTACTTCATCTGCAAAATCTGTGACAAGAAGCTGAGCTCGTCACACACTTATATTTTCCACAAGAGAATACACACGGGGGAGCGACCGTGCATCTGTCACGTGTGCGGTAAACAGTTCCGCGCGCCCAACGGACTCCAGCGACATCTCACCGAGACGCACGAACGACTGCGCCGGTACACATGCCAGATTTGCCACAAATCCTTCGCGAACTCGCAAAACCTCAAACAACACATGAGAATACACACGGGTGAGAGACCTTTCGTGTGCTCCCACTGTGGTAAGCGATTCACACAGAGTGGCTCGCTACATGTGCACCTCAAGACCCACAGTGCCACGCTCCCGCACGCTTGCCGGGACTGCGGCGCGAAGTTTCGTATGCGCTCCGGACTGACGCGGCACCGCCTCAAACACACCGGAGAGAGGCCGCACGTCTGCCGGCATTGCGGAAAAGGATTCAGACAAAAACATGAAATGAATGCGCACGCGCTCACGCACTCGGACAGCAAGCCGCACGTGTGCACCGTCTGCGGAACCGCCTTCCGCCAACGACGAGCGCTCCGCCATCACTGCAAACGACTACATGACAGCAAGCCCGCGGAAGACGCGCACGGCTACAACAACGCGATCAATTACATTCTGACATTGAACTCCAACAGCGAGACTAGCGAGTGCAGCATTTGCGGCAAGTCGGTGCCTCGCGCGAGTAAAGCGCGGCATAGACGTGCGCACGAAGCAGCGGGCACCCAACGCTACCGCTGCAGCGTGTGCGGGTGTGCCTTCTCAGACGGCGGCAACCTCGCTCGCCACGTTCGTGCCCTGCACGCCGCGCGTCGACCTCACGCGTGCCCACTCTGCCGCCGGACTTTCACACGCGCCGCTCACCTTGCCGACCACTTGCGTTCACACGACGATCGCAGGGATTACGTGTGCCACGTGTGCGGGAAAGCGTCTAAAACCGGCGCCGGACTGCGCTCACACCGCCGCGTGCACGCTGAAGAGTTCGAATTCGAGTGCCCGGCGTGTTCGGCGCGTTTTAAAACAGGCCGTCAGTTGCGCGCGCACGCCTCCGTACACACGGGTGAAAGGCCATACGCCTGCACCTGTGGAGCCGCCTTCCGCCTGCGCGCTCAGCTCACCAGACACGAGCGGACTCACACACGGACGAAAACAACCGCCGACTGA

Protein sequence:

>DPOGS212848-PA
MNGGTASDQLDELDIKCSGETHENLQDSPNICRICATVTDLVIPIFEGEGLQNNLAEKIHKHLPIKVSVQDVLPQVVCYQCSSTLLAWHELVQCCQQADQALRQQAAHRERKANEVTGTTSDPKALSLLTSSVRGVLSDYCQMLNINQETSDICYVCQECDGHPVSSSIENLSEHLQLVHKEVLISDELFGSDDLGEPAPQKELPNYSCPFCESMFSSPTRLIFHLNCHLEVCIDDGVYCCDQLFDNKTSFVSHLQSRHVRKVIESSYVCKSCGLTAGDLAELQKHINDNHPEAEDRYEKGKTEGSPKCQKFIPAVCSECNKTFSNKYNMLVHMRNHFGPASRFACGKCNKTYKSQGSLIYHHKVVHEGQLKFVCSSCGEAFPSRAARDVHARLHTGQRPFSCQYCGKAYRAKNTLYRHIDMHLNIRKYACNFCDRKFRKSTHLKCHLRTHERTTWIPSPDPYFLKRREFKIELNEPTHSNDTEPVTAHRSRGSDSDEDDKPLAEFAAGKPSDIYRNFYRALTKFRDHYVQHEIKTDRCSDLANSSDSEEERNCEDLDPTQYDDLSNSNMRKDKMNEETRLELSQVQTKINGKTYFICKICDKKLSSSHTYIFHKRIHTGERPCICHVCGKQFRAPNGLQRHLTETHERLRRYTCQICHKSFANSQNLKQHMRIHTGERPFVCSHCGKRFTQSGSLHVHLKTHSATLPHACRDCGAKFRMRSGLTRHRLKHTGERPHVCRHCGKGFRQKHEMNAHALTHSDSKPHVCTVCGTAFRQRRALRHHCKRLHDSKPAEDAHGYNNAINYILTLNSNSETSECSICGKSVPRASKARHRRAHEAAGTQRYRCSVCGCAFSDGGNLARHVRALHAARRPHACPLCRRTFTRAAHLADHLRSHDDRRDYVCHVCGKASKTGAGLRSHRRVHAEEFEFECPACSARFKTGRQLRAHASVHTGERPYACTCGAAFRLRAQLTRHERTHTRTKTTAD-