Monarch geneset OGS2.0

DPOGS212617
TranscriptDPOGS212617-TA1158 bp
ProteinDPOGS212617-PA385 aa
Genomic positionDPSCF300245 + 160697-165500
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0024755e-12662.94% 
BombyxBGIBMGA005212-TA6e-7152.28% 
DrosophilaCG42399-PB1e-2335.29% 
EBI UniRef50UniRef50_E2AE591e-2830.23%Uncharacterized protein KIAA0423 n=1 Tax=Camponotus floridanus RepID=E2AE59_CAMFO
NCBI RefSeqXP_396813.33e-2933.93%PREDICTED: similar to CG4648-PA, partial [Apis mellifera]
NCBI nr blastpgi|3287821812e-2833.93%PREDICTED: hypothetical protein LOC413368 [Apis mellifera]
NCBI nr blastxgi|3071802592e-3129.56%Uncharacterized protein KIAA0423 [Camponotus floridanus]
Group
Gene OntologyGO:00054883.2e-09binding
KEGG pathway 
InterPro domain[154-369] IPR0160243.2e-09Armadillo-type fold
[217-369] IPR0119891.9e-07Armadillo-like helical
Orthology groupMCL18836 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212617-TA
ATCTCGCCACCAGCAAGAGATATTGTACTGGAACCCTTTGTTGCGGTTCATCGAGATTCGGGCAATAACGAAGAGAAGCAAGAAAAAGAAGAAACTGAAAATGAAAATGAAAAAGAAACCGAAAAACTATTTACAACTGAAACTGTTAATGATAAAAATGAAGAAGAGAATGTACAGCGATCCCGGACACCCTCGATACAGATTGAAGATATTTCTGAAAAGTCTATTGCATCTTCAGAGACAGCTGCGGAAACATCGTCTGTTAAATCCATAATAAAAGAAACTCAAGAAAATTCTCATCATGATACTGAGCGGAGTCCAGAAACTAAATCAGCAAGTGCAAAATCATCACCAGGAGCAACAACTCCATCCAAAAACAGCGTTACATCAAATATTTCCTCAGCTTCGATGAGATCTCAGAATGCCACACCGGAACCACCAAGTCTAGTAATAGAAAAGCCACCCGACCGTGACGTTCGTAGCTCCTTAGCAGAATGCATGATACCAGCGCGGCATGAAGACTGGGAAACGATAGTGTCAAGTCTTATTGAAACCGAAAGACTGGCAAAAGACGAAATCGCCAGAGCTCCAGCCGCTAGCTGGAGGGCAGCGACTCGCAGTGTTGCTGCTCACGTTCGATCATTAAGATCCAGAGTGGCTAGAGCCGCTTGTTCAACAATGGGTGCTTTATTTGAGAATCGAGGAAGATGTCTAGATCCAGAGTTGGAAGAAGCCACCAGTGCTTTATTGGAAAGATGTGCTGATGTTAATCGGTTTCTGAGGGCCGACGCAACCACGGCGCTTGGACGGGTCGCTTGTGGTGGCAATTGTGCTCGTGCTGGCGTAGCGCTCGTTAGAAGAGGGGCTTCTCACAGAGCCGGACCAGTACGCGCTGCAGCTGCACAAGCACTCACAAAACTAGTTAGACAACAAGGCTCCTCTCGTATACTAGATCTACCCATAGAACCTCGCGTTGTAATTTTGCGTGCTGCTGGAGAACTGCTAGGAGATGCAAGTCCTGAAGCCCGGTTACATGCAAAGCACCTCTGTCTCGCACTTTCAGAAGATGTGCGTTTCCGGCAAATGCTAAAGGACGCTATGCCACTCAGCCGATATCGAGCGATAGAAAAATATGTAGATAAATTGCGTTGCCGGTAA

Protein sequence:

>DPOGS212617-PA
ISPPARDIVLEPFVAVHRDSGNNEEKQEKEETENENEKETEKLFTTETVNDKNEEENVQRSRTPSIQIEDISEKSIASSETAAETSSVKSIIKETQENSHHDTERSPETKSASAKSSPGATTPSKNSVTSNISSASMRSQNATPEPPSLVIEKPPDRDVRSSLAECMIPARHEDWETIVSSLIETERLAKDEIARAPAASWRAATRSVAAHVRSLRSRVARAACSTMGALFENRGRCLDPELEEATSALLERCADVNRFLRADATTALGRVACGGNCARAGVALVRRGASHRAGPVRAAAAQALTKLVRQQGSSRILDLPIEPRVVILRAAGELLGDASPEARLHAKHLCLALSEDVRFRQMLKDAMPLSRYRAIEKYVDKLRCR-