Monarch geneset OGS2.0

DPOGS214461
TranscriptDPOGS214461-TA1416 bp
ProteinDPOGS214461-PA471 aa
Genomic positionDPSCF300441 + 63806-66745
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0044291e-6650.19% 
BombyxBGIBMGA011248-TA3e-1428.06% 
DrosophilaCG9650-PH3e-1136.90% 
EBI UniRef50UniRef50_C3XT905e-1835.51%Putative uncharacterized protein n=2 Tax=Branchiostoma floridae RepID=C3XT90_BRAFL
NCBI RefSeqXP_002738554.11e-1631.76%PREDICTED: zinc finger protein 64-like, partial [Saccoglossus kowalevskii]
NCBI nr blastpgi|2608230146e-1737.24%hypothetical protein BRAFLDRAFT_71734 [Branchiostoma floridae]
NCBI nr blastxgi|2608111931e-2133.01%hypothetical protein BRAFLDRAFT_66809 [Branchiostoma floridae]
Group
Gene OntologyGO:00036764.3e-09nucleic acid binding
KEGG pathway 
InterPro domain[17-82] IPR0066124.3e-09Zinc finger, C2CH-type
Orthology groupMCL35022 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214461-TA
ATGAGTTTGTCCAAGAACGGCTTCGCCGGAGCACCCTTTAACGAACGCACTGAAGGGCGAAGTCCCTACTTTATTTTCCCGGCGAGCAAGACGCTGCTGCGGAAGTGGCTGGACGTGACGCCGACCAAGGGAAGGATCACCATCGACTCGGTCATCTGTCATCAGCACTTCAAGGAGGACGAGTACGACTTCATCCGTGGGAAGACCAGGCTCAAGGCGAAGGTCGTCCCGAGCGTGTTCGATGTGAAAAGTCGCCGTCGCCCCAAAGAGAAGGCAACACAGAACATTGTCGACATCAATGACGAGATACCCGCGGCCACCATAGACAAGAACACGTTGAAGACTGTAAGCACCATAGACAAGAATATAGAGACGGAAAACAGTAAACACCATGAACCGAACGGAGATATTATCGAACGACTAGTAGCAAACTCGTCACAAGACAACGAAGCACACAAAGACATAGAAGACATTATAACGAACTATCAAATAAAACAGATAAGACCTATAAACAGGGAAACGGAAACGGAAAAGGAAGGAGAAGTAGAGAAAGAGGGAGAAGAAAGAAGAGAAAAAGAGATACAGAACGAGGAGAATGACGTGGTGACGATAGAAGACGCCGCGCCCGTCTACATAGAAGTTAACGTAGACAAAGGTAGCGACGTGGCCGGCGACTGTATGATGGTGCTGGAGAGCGTCCAGTGCGAGGTGGACCCCGGCCTGTTCGAGGAACAGGACAACGACCACGACCTGGACAGAGACTCGGATGTCATCGACCTTGGAGAGAGGAAGGAGGATCCTATAAGTCTGCTGACGTCCAGCGACGAGGACGAGGTCGTCATAGAGGAACCTCACATCGACACTGTGGAGGTGTCCGACGGAGACTCGGAGCACGACCTCGAGGAGGAGGACGATCTGCCGCTGGTGAAGCTGGTGCCGCACGGACACCGGAACAGGAAGTGGCCGCTCTACCAGTACTACTGTGTGGAGTGCGGCTTCACCACCGACGACAGGACGGAGTACAAGAAGCACAGGAGCGATCACACCACCGTCCTGGAGGTGTGCCAAGTGTGCGGCTACACGACGGCCAGCAAGGCGCAGTTCGGCAGACACAAGAGGAAACACAAAGACGAGAAGAAATACAAGTGTCACCTGTGCGACTACAGGGCGAGGCACAACATGAGCCTCATATACCACCTCAAGTCGCACGAGCGGGTCATAGTGAACGGCAAGGACGGGTACCAGTGCAGCAAGTGTAGCTACCGGAGCAACGTGAAGAGCAGCCTGGTGAGGCACGTCAGAATGTGCGGGGGCAGGTTCGCCTGCGAGGGCTGCGACTACAAGACCAAGAGGGAGAGCGACCTGCGGAAACACCGGCTGCGGAGACACGCCGCCTCCAGGAGAATACACAAGTGA

Protein sequence:

>DPOGS214461-PA
MSLSKNGFAGAPFNERTEGRSPYFIFPASKTLLRKWLDVTPTKGRITIDSVICHQHFKEDEYDFIRGKTRLKAKVVPSVFDVKSRRRPKEKATQNIVDINDEIPAATIDKNTLKTVSTIDKNIETENSKHHEPNGDIIERLVANSSQDNEAHKDIEDIITNYQIKQIRPINRETETEKEGEVEKEGEERREKEIQNEENDVVTIEDAAPVYIEVNVDKGSDVAGDCMMVLESVQCEVDPGLFEEQDNDHDLDRDSDVIDLGERKEDPISLLTSSDEDEVVIEEPHIDTVEVSDGDSEHDLEEEDDLPLVKLVPHGHRNRKWPLYQYYCVECGFTTDDRTEYKKHRSDHTTVLEVCQVCGYTTASKAQFGRHKRKHKDEKKYKCHLCDYRARHNMSLIYHLKSHERVIVNGKDGYQCSKCSYRSNVKSSLVRHVRMCGGRFACEGCDYKTKRESDLRKHRLRRHAASRRIHK-