Monarch geneset OGS2.0

DPOGS206322
TranscriptDPOGS206322-TA2364 bp
ProteinDPOGS206322-PA787 aa
Genomic positionDPSCF300082 - 430270-437050
RNAseq coverage679x (Rank: top 19%)
Annotation
HeliconiusHMEL0125890.074.30% 
BombyxBGIBMGA005268-TA0.076.90% 
Drosophilacg-PJ1e-16142.94% 
EBI UniRef50UniRef50_A8DYD22e-15942.94%Combgap, isoform J n=25 Tax=Diptera RepID=A8DYD2_DROME
NCBI RefSeqXP_001959482.13e-16243.09%GF12033 [Drosophila ananassae]
NCBI nr blastpgi|1947543996e-16143.09%GF12033 [Drosophila ananassae]
NCBI nr blastxgi|3287883240.050.07%PREDICTED: zinc finger protein 84-like [Apis mellifera]
Group
Gene OntologyGO:00036761.8e-11nucleic acid binding
KEGG pathway 
InterPro domain[234-257] IPR0130871.8e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16411 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206322-TA
ATGTGTATGCCAACGCCGAGTAATGTTTCGGGCTTCGGCTACTCGTGGGGTTTCACGAGTAACGCAGATCTGACTAAGTGTGAAGTTGAGACACCCAGCAGCCTGAGCTATACCTTGGGGAATACGGTTTCTCCAGAAACAACATTGACTGTTATGTCTCACAAAACTCCACAAGTGCAAGACAAAGTCGGTGCAGTAGCAGTAGCGGTAACAAATGCAGCAACTCAAGTCACAAGGGTTGTCACAGCTGGAACACCTGTGACCGCTAGGAGGGCCATGTTTGTGTTAAATGAAGCCCCCACACACACTATCAATAGAGTCTCACAGCCAAACCAGACTGCAACTATACAGACTACACATTTATCGGCAAAACCATTCATGACACCAATTGGTCCTATTCAGTTGACTGCAGAGGAATGTAATGAGATTTTGATGAAACGGGCCCTTCAAGCACAAGGTATAGCCACACCCGTGATAGATGCTTCACAGTTGAATCATTCTATATTAAACGGAAGTCTGAAGAGTCTCGCCGAGGCCACTGCCCAGCAGTCACAGGCTGAATCAATACAGACAACTGTTCACCAACATCACTTGCTTAATACTAAACAGGAACCGGGTACAGCGAATAATTCACCAAAAACGGTGTATTCACAGACAGACATGATGACTAACACTGTGATGACGGTGCCACCAGTCAAGGAACGTCCATATTCATGTGAAGAGTGCGGGAAATCTTTTCTTCTGAAGCATCATCTCACAACACATGCAAGAGTTCATACAGGAGAGAGGCCTCATGTGTGTGGTCATTGTGGGAAAGCGTTTGCAAGGAAACATTGCCTCAATACACATCTACTGTTACATTCTGCTGACCGGCCATATAGATGTCATGAGTGCAAAATGGCATTCACACTCAAACACCATCTCGTCACACATTCACGAGTACACAGTCGCGAGCGGCCGTTCGTGTGCGGCGAGTGCGGTCGGGGGTTCCCCTTGAAGCGACATCTCGTGACGCACAGCAAGTACCACGCGGGGGAGAGACCCTTCGTGTGCGGGGACTGTGGGGAGAGCTTTGCACAGAAGGAACATTTAGTGATGCACAGCCGTTTCCACGGTTCATTGTCACCGTTCGTGTGTCCGGACTGTGGAGTCGCTTTCGCCAGGAAGTTCCAACTCGTCAATCACGGCCGAGTACACGGTAGAGTACCACACGCCTGCCCCGTCTGTGGAAAAGAATTTCTACAGAAGAGGACCTTAGTCGCGCATCTGAAAATTCACACCGGAGAAGGAGCGGTAGCTTGTTTGGAGTGTGGCGAAGCTTTTAAATGTAAACCCGAGCTACAGCAGCATTTGAAAATAACGAGGCATTGTCTCACTGCGACCCAACAGCAGCAGATAATTGGTGCTGATGGTCAAATTATCACTGAGAAGGCATCACCAGGATATGTGTGTCCTGAATGCGGCTCTTGTTTCAATACAAAGGAGGCGTTGTCACTCCACGTGAGGTTGCACGCGGGCGACCGGACGTGTGTCACTGATCTGTGCGCACTCACCGCCGCACTACAGCCCAGTCTGGTGCATCACAATCAACCCATTCAGATAATATCGTCCAACCCCAGTAACGTAGTTCACACACAGGTAATTGCAACGAACCATCACATGACGACGCCACGACCGAAATTACATTTCTGTGTCGACTGCGGTAAAGGATTTGCAGCGAAACACGGCTTATTGGCTCACCATAAACGGCATCCAGATAGCAGTTGTACCCTTCGGACGCACGTGTGTGATCAGTGTGGGAAAGCTTTCTTCCAGAAAAATCATCTCATGCTGCATCAAAGACAACACATGGACCTACCGCCTAGAAATTCACAAGCACAACAACAAGCAACACAACAAGCACAACAGCAAGCGGCCCAACAGGCGGCACAGCAAGCGGCGCAACAGGTAGCACAACAAGTACAGCAGCAGGTGCAGCAACAGCAACAACAGCAGCAGCAGCAGCAGCAACAACAGCAGCAGCAGCATCAGGTGCAGCACCAGAACCTCGAGATGGAACAGCAAGATCACCAGCAACCGACTCACATACAGGTTGTAACGAATCGTTCGGGTCAGGTGATCGGTAACCAGATAGTGGTGGGCACGGTCGGCGGGCGACGCGCTCTGCTCGGCTCTCCGCTACACATCGTCACCAGGGACGGCAAACAGCTCGCCTCGCAGATTGGACACAAGAGACACCACGCCGGCGACGTCAAGGAGGTCGGCGGGCTGGTGCTGTCGGGTCGGGGGGCGACCGGGCTCATCAAGTACGAGATATCGATACCGCCGCCGGCTTCCGTGGTACAAGTACAGCAGCTAGACTGA

Protein sequence:

>DPOGS206322-PA
MCMPTPSNVSGFGYSWGFTSNADLTKCEVETPSSLSYTLGNTVSPETTLTVMSHKTPQVQDKVGAVAVAVTNAATQVTRVVTAGTPVTARRAMFVLNEAPTHTINRVSQPNQTATIQTTHLSAKPFMTPIGPIQLTAEECNEILMKRALQAQGIATPVIDASQLNHSILNGSLKSLAEATAQQSQAESIQTTVHQHHLLNTKQEPGTANNSPKTVYSQTDMMTNTVMTVPPVKERPYSCEECGKSFLLKHHLTTHARVHTGERPHVCGHCGKAFARKHCLNTHLLLHSADRPYRCHECKMAFTLKHHLVTHSRVHSRERPFVCGECGRGFPLKRHLVTHSKYHAGERPFVCGDCGESFAQKEHLVMHSRFHGSLSPFVCPDCGVAFARKFQLVNHGRVHGRVPHACPVCGKEFLQKRTLVAHLKIHTGEGAVACLECGEAFKCKPELQQHLKITRHCLTATQQQQIIGADGQIITEKASPGYVCPECGSCFNTKEALSLHVRLHAGDRTCVTDLCALTAALQPSLVHHNQPIQIISSNPSNVVHTQVIATNHHMTTPRPKLHFCVDCGKGFAAKHGLLAHHKRHPDSSCTLRTHVCDQCGKAFFQKNHLMLHQRQHMDLPPRNSQAQQQATQQAQQQAAQQAAQQAAQQVAQQVQQQVQQQQQQQQQQQQQQQQQHQVQHQNLEMEQQDHQQPTHIQVVTNRSGQVIGNQIVVGTVGGRRALLGSPLHIVTRDGKQLASQIGHKRHHAGDVKEVGGLVLSGRGATGLIKYEISIPPPASVVQVQQLD-