Monarch geneset OGS2.0

DPOGS213692
TranscriptDPOGS213692-TA1347 bp
ProteinDPOGS213692-PA448 aa
Genomic positionDPSCF300219 + 352819-357686
RNAseq coverage97x (Rank: top 62%)
Annotation
HeliconiusHMEL0210724e-7552.11% 
BombyxBGIBMGA010349-TA1e-8745.98% 
DrosophilaCG32483-PA4e-4326.08% 
EBI UniRef50UniRef50_B9W4M61e-4831.11%Putative uncharacterized protein Cc1F03.4 n=2 Tax=Apocrita RepID=B9W4M6_COTCN
NCBI RefSeqXP_975315.14e-5331.70%PREDICTED: similar to CG3344 CG3344-PA [Tribolium castaneum]
NCBI nr blastpgi|2700019725e-5231.70%hypothetical protein TcasGA2_TC000887 [Tribolium castaneum]
NCBI nr blastxgi|2700019728e-5231.36%hypothetical protein TcasGA2_TC000887 [Tribolium castaneum]
Group
Gene OntologyGO:00065084.4e-66proteolysis
GO:00041854.4e-66serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[54-448] IPR0015634.4e-66Peptidase S10, serine carboxypeptidase
Orthology groupMCL30744 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213692-TA
ATGCGTAAAGTAACTATTTTTATTTTAGTTGCAATAGGTCTTGTTGCTGTTGCCGGGCTTGGCGTTCTGATTTGGTGGTTGGTGTCCTCTGATGATCCGCCCTATGAAACAATAGAATTGAAGGGAGAAAACATAGGCAATGTCAGCTATACGGCAGCATTCACAAGAGTACGTGGCAAAGGTGACGTCTTCTGGTGGTTTTACCCAACCCTTGCCGAGACCCCAACTAAAAGACCTCTGTTGTTATGGTTCCATGGCGTCACAGGACTCCCAGCAAGCTTCCTCGCCAACTTCGGCATGTTCGGACCGTACGACGTGCATCTCACTAAGAGAAATGATTCCCTGGTAAATGACTATAATTTACTATTTGTCGATGCCTCCATTGGGACCGGCTTCAGTACAGCCGAATCAGAAGATAGAGATCTCCCAAGTTTAGATGAAAATGTTGAGAGTTTGTGGAGAATGCTTCAATCATTTTACGATGTACACAACGAGTACCGGGAATCTCCTATTTACTTATGTAGCATGGGAGACGGATCGCAGTTAGTGATCCCTTTAGTTACAAAGTTAGCTATGGAAGATAACGTGTCTGATCAAATCAAAGGAGTTATTTTAGGCAATCCAGTTATTTCACCAGCCTTAGCGTTAACGAAACTTGGATATTATTTGGAGGAACTCGCTTACATCGATGGGAGAGGTAGAACAGAAATTGAAAGTTTTTCAAACCTTACGTACTCGCTTGTTCAATCAGAATCTTTTGAGCGGGCGTTTGATCAGTTTTCATCCATAGACAATTTTGTTAATGACAACGCCGGAGCCGTTAGCGTCAATTTAAACTATATTGTTGAAAAACTTACGCGCGAATCAAATAGGGATTACTTTGGACAAAACAATTACGTAAACAGAATACTTGGTTTAAGTCAAAACGCTTCTGTCTTTATGGACACTGTAGTTAGACCAGCCCTAGGTATCTCAAATGAGATAAGATACGACGGACAACGGGAAAAAGCTATTCAAGCGTTTAAGAGCTCTTATATGAAACCTATTGTTCATGCAGTTGAGCACATTCTTAATGAAACAAATGTGAACGTTACTATTTATAACGGCAACTTGGATGCGGTTTCTAATACTCCAGGTCAGTGGGAGTGGATCCGAACTCTGAATTGGCAAGGTCAAGAAGAATTCCTAAATCAGACAAGGAGGCCAATGGTATTAAACGGGTTGCTGGAAGGTTATTCCAGAATAACCGATAAACTGCGATTCTATTGGATAAACGTAGCTGGACTTATGGTTCCACTGGAAAATCCTGTCGCTTTTAAGTCACTTTTGCATTTTGCAACGTCGTGA

Protein sequence:

>DPOGS213692-PA
MRKVTIFILVAIGLVAVAGLGVLIWWLVSSDDPPYETIELKGENIGNVSYTAAFTRVRGKGDVFWWFYPTLAETPTKRPLLLWFHGVTGLPASFLANFGMFGPYDVHLTKRNDSLVNDYNLLFVDASIGTGFSTAESEDRDLPSLDENVESLWRMLQSFYDVHNEYRESPIYLCSMGDGSQLVIPLVTKLAMEDNVSDQIKGVILGNPVISPALALTKLGYYLEELAYIDGRGRTEIESFSNLTYSLVQSESFERAFDQFSSIDNFVNDNAGAVSVNLNYIVEKLTRESNRDYFGQNNYVNRILGLSQNASVFMDTVVRPALGISNEIRYDGQREKAIQAFKSSYMKPIVHAVEHILNETNVNVTIYNGNLDAVSNTPGQWEWIRTLNWQGQEEFLNQTRRPMVLNGLLEGYSRITDKLRFYWINVAGLMVPLENPVAFKSLLHFATS-