Monarch geneset OGS2.0

DPOGS200510
TranscriptDPOGS200510-TA1410 bp
ProteinDPOGS200510-PA469 aa
Genomic positionDPSCF300450 + 20790-25651
RNAseq coverage293x (Rank: top 38%)
Annotation
HeliconiusHMEL0176306e-12862.72% 
BombyxBGIBMGA001706-TA7e-9039.31% 
Drosophilapita-PA4e-2122.81% 
EBI UniRef50UniRef50_UPI000202705A1e-2433.82%UPI000202705A related cluster n=1 Tax=unknown RepID=UPI000202705A
NCBI RefSeqXP_002730468.17e-2130.77%PREDICTED: zinc finger protein 345-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|3443082523e-2536.60%PREDICTED: hypothetical protein LOC100661788 [Loxodonta africana]
NCBI nr blastxgi|3017811481e-2826.63%PREDICTED: zinc finger protein 91-like [Ailuropoda melanoleuca]
Group
Gene OntologyGO:00036766.9e-08nucleic acid binding
GO:00056342.1e-05nucleus
GO:00082702.1e-05zinc ion binding
KEGG pathway 
InterPro domain[401-434] IPR0130876.9e-08Zinc finger, C2H2-type/integrase, DNA-binding
[437-457] IPR0227551.8e-06Zinc finger, double-stranded RNA binding
Orthology groupMCL26468 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200510-TA
ATGGAAACTCATCATAACTTAGTTGAGATCGAATATAAACCGGAGTTGGACTCGCCTAAGACGATATGCCGCTGTTGTCTCTCGAGCGACCGACGGGTTGTGAAAATTGAGAATTACCGTGAACTTTTCGTTGAACTGGCTGATATAATAGTGTCAGATTCCGATGGCCTCCCTCAGTGGCTCTGCTGGGAGTGCAGTTGTCTTTTGCTGAAAAGTGTACGATTCAAGCAGAAAGTGCTTAAAGCCCATCTCACACTTTACAACTATCATAGTAGATGTGCTCCGTTCCCCATAGACGGCCAGGATCCTGAATTGACTAAGTACGCCAACCCGCAACTGAGCAGCTCGGCCACCCTGGTCATAGACAATGTCAAGACAAAAACGGGATACCATAAAGTTTTGGAGCATGAGAAAATCCACTTCATGCCGCCCTCGGACGATACTCACTTCCTGGATGAAGAGATTCCTCTCAAACTGGAGACGGAAGATGATGTGCCACTACAGGAGATACGAGAAAATAAATTGAAAGACCTGGCACTAGACGGCTTCATGACAGAATCTAATCAAGAGAAGAAAGAAAAGACTAAGAAAACGAAAAAGAAAGTTAAGACAAGAGTCACAGATAACATAAAGACAGAGAATGAGCTGGAGAGCGACACACAAGAGGCTAAAACACCAAAGAAAAGAAACCTTCGCAGAACTATAGACATAGACGAGAACAAGATACGAGTCATCACACTGGACCCGGCTGAACAGCTGAGGCAGAGACAGGCAGAGAACGAGGCCACACTGAAGTTCCCGTTCCAATGCCACCTGTGCTTCAAAGGTTTCAACTATGAAGAGAAGCTCAAGAACCACATGTTCAAACACAGCCCGGCTCGTGGCAGTTACAAGTGTGAAGTGTGCAGTATGTATCTACCCACACCGTACTCGGCATCTGTCCACGGTCTCACTCACACACTGAGGTATGAGTGTGTACAGTGCGGGAGACGGATGACAGACAGGCTGGCTATAGTTAATCATTACAGATCTCAACACGAGGGGGTGCTCTCGGTCTACACTTGTCACATATGCGGGAAAGTATCGAATAATGATAAGACTCACCGCGGTCACATGAGGAACCATCATTCTGGCTCCAGAGCCGTCTGCAGGGAGTGCGGCAAGAGCTTCGTCAACAACGACTCCCTCGCCGAGCACATGCTCATTCACCAGGGCATCAAGAACTACGAGTGTCCGGAGTGCGGCAAGAAGTTCCGCACGAGGAATCAGATCAGGCACCACCTGGTCAAACACAGCGACCACAAGGAGTTCTACTGTGTGGAGTGTGACGTCAGGTTCAAGTCAGCTCACACCCTCCGCCAACACTTGAAAAGGACGACGAAACACAAAGACAAGAAGAGCCTCAAGTGA

Protein sequence:

>DPOGS200510-PA
METHHNLVEIEYKPELDSPKTICRCCLSSDRRVVKIENYRELFVELADIIVSDSDGLPQWLCWECSCLLLKSVRFKQKVLKAHLTLYNYHSRCAPFPIDGQDPELTKYANPQLSSSATLVIDNVKTKTGYHKVLEHEKIHFMPPSDDTHFLDEEIPLKLETEDDVPLQEIRENKLKDLALDGFMTESNQEKKEKTKKTKKKVKTRVTDNIKTENELESDTQEAKTPKKRNLRRTIDIDENKIRVITLDPAEQLRQRQAENEATLKFPFQCHLCFKGFNYEEKLKNHMFKHSPARGSYKCEVCSMYLPTPYSASVHGLTHTLRYECVQCGRRMTDRLAIVNHYRSQHEGVLSVYTCHICGKVSNNDKTHRGHMRNHHSGSRAVCRECGKSFVNNDSLAEHMLIHQGIKNYECPECGKKFRTRNQIRHHLVKHSDHKEFYCVECDVRFKSAHTLRQHLKRTTKHKDKKSLK-