Monarch geneset OGS2.0

DPOGS213691
TranscriptDPOGS213691-TA1302 bp
ProteinDPOGS213691-PA433 aa
Genomic positionDPSCF300219 + 347020-351034
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0210727e-9870.61% 
BombyxBGIBMGA010348-TA7e-17473.20% 
DrosophilaCG32483-PA2e-11650.12% 
EBI UniRef50UniRef50_Q9W0N82e-11047.74%CG3344 n=26 Tax=Diptera RepID=Q9W0N8_DROME
NCBI RefSeqXP_002047100.18e-11849.54%GJ13239 [Drosophila virilis]
NCBI nr blastpgi|1953766392e-11649.54%GJ13239 [Drosophila virilis]
NCBI nr blastxgi|1955867401e-11648.64%GD13517 [Drosophila simulans]
Group
Gene OntologyGO:00065081.8e-175proteolysis
GO:00041851.8e-175serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[17-431] IPR0015631.8e-175Peptidase S10, serine carboxypeptidase
Orthology groupMCL10434 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213691-TA
ATGTTTGTTCTAAAAATATTATCAGTTTTAACAACGGCATTTTATGTCAACGCTCAATTCGGAGATTACAAACAAGATTTCGGGTATGTGACTGTGAGAGATGGAGCACACATGTTCTATTGGATGTACTACACCACCGCTAATGTATCAAATCACACTGAGAGACCTCTCATCGTGTGGCTGCAAGGTGGACCCGGTGGTTCCTCTACAGGCATTGGTAACTTCGAAATACTTGGTCCACTTGATGAGAATTTGCAAGAAAGAAACTACACTTGGGTGAACAATTTTAACGTCATCTTCGTCGATAACCCCGTTGGTACAGGTTTCAGCTACGTCGATGATCCTATTTATCTGACAACAACAAATGATCAAATAGCTCTCGACTTTGTCGAATTGATGAAAGGGTTTTATCGATCTAATCCTGAGTTCGAAGAAGTGCCACTTTATATTTACGGACAATCATACGGCGGAAAGATGGCGATTGACATGGGTATCCGAATGCGTGAGGCTGAAATTGCTGGAACGATAAAATCGAATTTAAGAGGAATAGCTATGGGCAACGCTTGGATAAGTCCAGTAGATTCCACTCTCACTTGGGGACCACTGCTGCTAGCTGCTGGTCTGGTTGATCAGACTGGATACGAACAAATACAAACATCTGCACGAGAGACTCAACGGTTATTCAACGAGGGATTATATCTAGGTGCAACTGCTCAATGGTCTGCGACCCAAACGGCTGTTTTACAAGCAACAACTAGAGTGGACTTCTACAATATATTAACTAAAAATCCAGTGCCCCAGACGTTTGATAATGAATTAGAAAAACTTATGCTGCCAGATAGTTTTTATGGAAAATCGAGAAGATCAAGAAATACTCTTAATACCTTAATGAACACGAGAGTGAAGGAAGCTCTTGGAATCCCAGCCAATGTCACCTGGAGTGCGTTGTCAAACAGTGTATTTCACGCTCTCAGGACGGATTTTATGAAACCGGTGACCGAAAACATCGAGAAGCTTCTAAACGAAACGGACATTATTATCACAAAGTACAATGGAAACCTCGACCTCATTTGTAGTACCACAGGTCAAATCTTATGGGTGGATCGTCTTCGCTGGCAGGGAGCTGAAGGCTACAAAAACGCTACCCGCCATCCAATCTGGATCAACAATCGATTGGAAGGATACTACAAATCCTACAGAAACTTTCGTTTCTTCTGGATAAATCTAGCCGGACACAGTGTACCTAGAGACAATCCCGCGGGAAGCAGCGCCTTCCTACGTGACATGACTTCATTTGGCTAA

Protein sequence:

>DPOGS213691-PA
MFVLKILSVLTTAFYVNAQFGDYKQDFGYVTVRDGAHMFYWMYYTTANVSNHTERPLIVWLQGGPGGSSTGIGNFEILGPLDENLQERNYTWVNNFNVIFVDNPVGTGFSYVDDPIYLTTTNDQIALDFVELMKGFYRSNPEFEEVPLYIYGQSYGGKMAIDMGIRMREAEIAGTIKSNLRGIAMGNAWISPVDSTLTWGPLLLAAGLVDQTGYEQIQTSARETQRLFNEGLYLGATAQWSATQTAVLQATTRVDFYNILTKNPVPQTFDNELEKLMLPDSFYGKSRRSRNTLNTLMNTRVKEALGIPANVTWSALSNSVFHALRTDFMKPVTENIEKLLNETDIIITKYNGNLDLICSTTGQILWVDRLRWQGAEGYKNATRHPIWINNRLEGYYKSYRNFRFFWINLAGHSVPRDNPAGSSAFLRDMTSFG-