Monarch geneset OGS2.0

DPOGS202367
TranscriptDPOGS202367-TA1083 bp
ProteinDPOGS202367-PA360 aa
Genomic positionDPSCF300104 + 107176-113679
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0042494e-5967.72% 
BombyxBGIBMGA013754-TA3e-5169.92% 
DrosophilaCG4914-PA4e-1652.31% 
EBI UniRef50UniRef50_E9IRT31e-1835.61%Putative uncharacterized protein (Fragment) n=2 Tax=Myrmicinae RepID=E9IRT3_SOLIN
NCBI RefSeqXP_001845724.13e-1650.70%serine protease [Culex quinquefasciatus]
NCBI nr blastpgi|3640236272e-3175.95%seminal fluid protein CSSFP038 [Chilo suppressalis]
NCBI nr blastxgi|3640236271e-3175.95%seminal fluid protein CSSFP038 [Chilo suppressalis]
Group
Gene OntologyGO:00038246.9e-09catalytic activity
KEGG pathway 
InterPro domain[162-199] IPR0090036.9e-09Peptidase cysteine/serine, trypsin-like
[32-332] IPR0161816.3e-08Acyl-CoA N-acyltransferase
Orthology groupMCL19870 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202367-TA
ATGGCGAGAAATGCGGAAGCTGAGGTGGCAAAAATGAAAATGCTAGAAGAACGCATCAAGGCGCCATCTATCTGGGGACGTGTGCCCTGCGGCATCAGGTTCGAAGATCTGCAAGACAGGAGCTACGAGGATGCTGTCAGATTATTGAAAAACCATTTCTTATCAGAAGAGATAACCTACAGATCCGTGAAGATATCCAACGAGAAAGAGGCGACTGATGAATTCGTCCACAATGTTAGGATCTGGATGAAAGACAAAATGTCCATAGCAGCGGTCAAAGAGGGAACAAATAAACTAGTTGGAGTTCTGATCATGAGGATACAGGAGAAAACCGACCGCTACCTCAAACCATCTCAGTGTGTCCTCGGTGATTTATTGCGTACGAAGCGTGGTGTGTATTCGAAGAATTTCTTCGGTGGTGTTTGGGGCAACCGACCACCACTACTTGAAGCGGGCCAGGCCAAGACTACGTGCACATGTAAATGTGGCGAAAGAAATGAAGTCTCCCGCATCGTAGGGGGTGAGGAGGCTGGTGTCAATGAGTTCCCTTGGGTTGCCAAAATGACATATTTTAAAAAGTTCTACTGCGGCGGCGCATACTCCCGTACATTTAGTCACGTTAAAATAACACACAGCCCTCAATACACGCAAGTGATTACGTTTTATAGAGAAATTGAGAAGGGTGCTAATTTGTTTGAGAAACTTGATGTGAAGCGTTACCTTAAGATATACGTGCTGGCATTGAAAGCTAGATATCGGCATAGGGGTATAGCAAAGGAAATGTTAAAGGCTGCGATAGGGTTGAGTGAAAGTGCCAACGTTCCCGCTATATCTGGCATCTTCACGACGGGTAGAGGCCAGCAGATCGCTGAAGAACTAGGCTTTGAGAAGTTCAACGAATTATATTACATAAGATACATCATAAACGACGAGATAGTTTTCTGGGACACCGGTCTAGGTAATTACGGTGCCGCTCTCATGGCGTACAGGATACCAACGGTAGTTGAGCCCCAGGAACTTCAGATGCAGCCGTCATCACGGTTCAACATACAACAGCCAGATGATGATGATGATGATGATTGA

Protein sequence:

>DPOGS202367-PA
MARNAEAEVAKMKMLEERIKAPSIWGRVPCGIRFEDLQDRSYEDAVRLLKNHFLSEEITYRSVKISNEKEATDEFVHNVRIWMKDKMSIAAVKEGTNKLVGVLIMRIQEKTDRYLKPSQCVLGDLLRTKRGVYSKNFFGGVWGNRPPLLEAGQAKTTCTCKCGERNEVSRIVGGEEAGVNEFPWVAKMTYFKKFYCGGAYSRTFSHVKITHSPQYTQVITFYREIEKGANLFEKLDVKRYLKIYVLALKARYRHRGIAKEMLKAAIGLSESANVPAISGIFTTGRGQQIAEELGFEKFNELYYIRYIINDEIVFWDTGLGNYGAALMAYRIPTVVEPQELQMQPSSRFNIQQPDDDDDDD-