Monarch geneset OGS2.0

DPOGS215628
TranscriptDPOGS215628-TA1050 bp
ProteinDPOGS215628-PA349 aa
Genomic positionDPSCF300041 - 1985589-1989381
RNAseq coverage6000x (Rank: top 2%)
Annotation
HeliconiusHMEL0059429e-12884.49% 
BombyxBGIBMGA003688-TA0.083.67% 
DrosophilaCG2145-PA9e-4536.67% 
EBI UniRef50UniRef50_F6IA340.085.96%P102 protein n=2 Tax=Obtectomera RepID=F6IA34_HELVI
NCBI RefSeqXP_001606738.15e-5040.45%PREDICTED: similar to GA15266-PA [Nasonia vitripennis]
NCBI nr blastpgi|3346831510.085.96%P102 protein [Heliothis virescens]
NCBI nr blastxgi|3346831512e-17785.96%P102 protein [Heliothis virescens]
Group
Gene OntologyGO:00167889.8e-47hydrolase activity, acting on ester bonds
KEGG pathwaygga:4268885e-29 
 K08014 (RAPGEF3, EPAC1)maps-> Leukocyte transendothelial migration
    Long-term potentiation
InterPro domain[83-238] IPR0189989.8e-47Endoribonuclease XendoU
Orthology groupMCL20406 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215628-TA
ATGAAGCTGGCGCTAATTCTCCTTTTCTGCGTCGTTGCCAGCCAAGCTGATGATATAGCATCAGCTGCTGGCCAGATCTTCAACAGCATTCTCCCTAACCTGATAAGTAATTCAGTCACTGGTCAACAAGGTAACACCGCTACGAACACCTTGCAGCAAATCGGTACCGTTGTGGGTGGGGTCGTCGACTACGCCAAGAAGAAGAGTTACGAAGAAATGTTACGCCAAGTACAAGACTCCACTAGTGACGAAGACCTGCTTCGCGTCAGCGAGGAAATGTTTAATGCGGACGTCAATAATGCATTCAACTACATCCAGGTCAATTTGCAAGGCAAAACCAGCCCCATGTCTAAGAACGACGAAGCGCAAACACACCTCCTGAACGTGCCAGAGAATGTTTGGAATGGACCAACTATCCGACCATTCGTGGCGCTCTTTGACAATTATCACAAAAATGTTATACGACCAGAATTCGTGACACCTAATGAGGAAACGGAGCAGATAACATTTATCAATACTATACTAGCGACGGGACCCATGAGGAGTTTAATGTCTTTGCTCGTCAGTAAAGGACTCAATCAAATGAACGAATACAATGAACAAGTTGAATTATTGAAGAAGATCTGGTTCACGAAGTACGCACGACACTGGACCGGGCTGTGCAAGTGTAGCTGTGCCTTCGAAAATATCTTCATGGCAGAGCTCAAGTCCAATGATGTTTTAGGTTTACACAGCTGGTTATTCTTCGCAAAACGCGAGCTCGACAATAAAGCCAACTATTTGGGATACATTAACAAACTCGACCTTTCCGGAAAAGGGATGATTCTAAAACAACACTCGGTTCTCAGCGAAACGAAGGACGCGCCGGAAATAACAATGTTTGTGGGAACTTCCCCCGAGCTTGAAGTAGCTTTGTACACGCTGTGCTTCATGGCGAGACCAAACCGTCCCTGTAATCTGCGCTACAATAATATCCCATTCAGCATTCAAACAAAAACAATAAAGGCTGAAAACCTCGTGGTCATTGACACCGCATATCCAGTTTTTTAA

Protein sequence:

>DPOGS215628-PA
MKLALILLFCVVASQADDIASAAGQIFNSILPNLISNSVTGQQGNTATNTLQQIGTVVGGVVDYAKKKSYEEMLRQVQDSTSDEDLLRVSEEMFNADVNNAFNYIQVNLQGKTSPMSKNDEAQTHLLNVPENVWNGPTIRPFVALFDNYHKNVIRPEFVTPNEETEQITFINTILATGPMRSLMSLLVSKGLNQMNEYNEQVELLKKIWFTKYARHWTGLCKCSCAFENIFMAELKSNDVLGLHSWLFFAKRELDNKANYLGYINKLDLSGKGMILKQHSVLSETKDAPEITMFVGTSPELEVALYTLCFMARPNRPCNLRYNNIPFSIQTKTIKAENLVVIDTAYPVF-