Monarch geneset OGS2.0

DPOGS210398
TranscriptDPOGS210398-TA1113 bp
ProteinDPOGS210398-PA370 aa
Genomic positionDPSCF300291 + 64555-65667
RNAseq coverage597x (Rank: top 21%)
Annotation
HeliconiusHMEL0130072e-16885.56% 
BombyxBGIBMGA008284-TA1e-17586.29% 
Drosophilafne-PB6e-5937.41% 
EBI UniRef50UniRef50_F4W7V72e-6340.31%ELAV-like protein 2 n=5 Tax=Acromyrmex echinatior RepID=F4W7V7_ACREC
NCBI RefSeqXP_394166.36e-6038.97%PREDICTED: similar to found in neurons CG4396-PA [Apis mellifera]
NCBI nr blastpgi|3838650341e-6339.75%PREDICTED: ELAV-like protein 2-like [Megachile rotundata]
NCBI nr blastxgi|3838650346e-6339.90%PREDICTED: ELAV-like protein 2-like [Megachile rotundata]
Group
Gene OntologyGO:00001661e-24nucleotide binding
GO:00036767.6e-24nucleic acid binding
GO:00037235.5e-08RNA binding
KEGG pathway 
InterPro domain[285-368] IPR0126771e-24Nucleotide-binding, alpha-beta plait
[289-362] IPR0005047.6e-24RNA recognition motif domain
[15-30] IPR0023435.5e-08Paraneoplastic encephalomyelitis antigen
Orthology groupMCL25459 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210398-TA
ATGGAGAATGGCGGCGGCGACGCTAATTCCAATGTGTCACCCACCAAACTTATGGTAAACTATATTCCAGAATTGATGACGCGAGATATGATGTATGCCCTCTTCTCCGCCATGGGTAAAATAGAAAGCTGTAAATTAATAGCAAATAGAGGATACGGATTCGTAGAATACGAGAAACACGAGGACGCCGAGAAGGCGCGAGCCGCATTCAACGGCCTGTTGATGCAGGGCAAGACATTAAAGGTGTCCTTCGCGCTTCTGAACCCCGAGAACAAGGTCAGCCACAAACCGGACACAGAGTCGAATCTTTATATAAGTAATCTACCTCCAGACATGACACTAGAGCGGTTAAACATGTTATTCGGTCAGTTCGGAAACATAACTAACAGTCGGGTGTCACAAGGTATCGGATTCGTGTGCTACGAGACGCGGCAGCAGGCTGAGGACGCGATAACGCACATGAACGGTCAGTCACCGATTCCTGGCGCGGGCGCCATCGTGGTGAAGTTCGCCAACAAACCCAGCGCCAACAAGAACGCCCCGCGGCCCATTCAGAGGGTTGGCGTGAGCGCGGCGCCGCTGGCCTACAACGGTGCGGCGGCCGTGTTCAACGGTTCCACGCCGGCCTTCAACGGCACCAACCTGAGCGCGTTCGGAGCGCGGCCCGCCACCGCGTTCGCACCCGGCTTCCCGCAGGCGTCGCCGCCGCCTCTGCTGCCGTCCCCGGGTAAGGCACTGCCCTTCATCAACAAGGGCCAGCAGCGCTTCAACCCCATGGCTGCCACTAACCACAGTCCGTTACCGTTGTTAGGCGCCCCGGCCAGTCCGGTGCCGCTGCTGGGCGCCCCGGCCGCCCCTCAGACCACGGTGTACGTTTACAACGTGGGCGAGGATACCGAGGAACTGGCGCTCTGGCAACTGTTCGGTCCCTACGGTGCCATAGACTCCATTAAAGTGATCAAGGACCCCGAGACGAAGAAAAATAAGGGGTTCGCGTTCGTTAACATGCGGGAGTACGACGAGGCCGCTATGGCCATCCAAGCTCTGAACGGATACACGCTCAACGGCCAGGTCCTCTCGGTTAGCTTTAAGACTCAGAAGAGAAGTAATTAG

Protein sequence:

>DPOGS210398-PA
MENGGGDANSNVSPTKLMVNYIPELMTRDMMYALFSAMGKIESCKLIANRGYGFVEYEKHEDAEKARAAFNGLLMQGKTLKVSFALLNPENKVSHKPDTESNLYISNLPPDMTLERLNMLFGQFGNITNSRVSQGIGFVCYETRQQAEDAITHMNGQSPIPGAGAIVVKFANKPSANKNAPRPIQRVGVSAAPLAYNGAAAVFNGSTPAFNGTNLSAFGARPATAFAPGFPQASPPPLLPSPGKALPFINKGQQRFNPMAATNHSPLPLLGAPASPVPLLGAPAAPQTTVYVYNVGEDTEELALWQLFGPYGAIDSIKVIKDPETKKNKGFAFVNMREYDEAAMAIQALNGYTLNGQVLSVSFKTQKRSN-