Monarch geneset OGS2.0

DPOGS213051
TranscriptDPOGS213051-TA768 bp
ProteinDPOGS213051-PA255 aa
Genomic positionDPSCF300016 - 1326965-1328294
RNAseq coverage252x (Rank: top 42%)
Annotation
HeliconiusHMEL0150821e-8962.06% 
BombyxBGIBMGA013962-TA2e-5750.70% 
DrosophilaCG5255-PA2e-5242.73% 
EBI UniRef50UniRef50_E3WWU22e-5546.29%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WWU2_ANODA
NCBI RefSeqXP_001661388.12e-6145.53%serine-type enodpeptidase, putative [Aedes aegypti]
NCBI nr blastpgi|1571282963e-6045.53%serine-type enodpeptidase, putative [Aedes aegypti]
NCBI nr blastxgi|1571282968e-6145.53%serine-type enodpeptidase, putative [Aedes aegypti]
Group
Gene OntologyGO:00038242.3e-83catalytic activity
GO:00042521.4e-79serine-type endopeptidase activity
GO:00065081.4e-79proteolysis
KEGG pathway 
InterPro domain[19-254] IPR0090032.3e-83Peptidase cysteine/serine, trypsin-like
[29-249] IPR0012541.4e-79Peptidase S1/S6, chymotrypsin/Hap
[57-72] IPR0013142.2e-14Peptidase S1A, chymotrypsin-type
Orthology groupMCL14547 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213051-TA
ATGTTAAAGATCATTATATTTTTCTGTTGTATCTATGTAGCAAAAGGTTTTGTGTTGCCGCATTTCATCGATAGGCCTGAACCTTGGATCGTAGGAGGAGAAGATGCCCCTGGAGGTTCTGCACCTCACCAGGCTTCGTTACGGAGCCTATTCAACTTTCACTTCTGCGGAGGATCCATCATAAGCAACAGATGGATTTTGACAGCTGCCCACTGTACTTTAGGGGAATCGAGTTTTACGATGAAGGTGGTTGTTGGAACTAACAGTTTGACAAACGGTGGAAATTCCTATTCCGTAGATAAAATAATAATACATGAAAACTTTAGCTACAGCGAAATTAAAAATGACGTCAGCGTTATTAAAGTTGCTAAGGATATAATATTTAATGAGCTTGTACAACCAATACAATTGCCAGACGCAAACACGCTTGGAGGTGCCAATCTTACCCTGACAGGATGGGGCACGACATCGTATCCTGGTAGCAGCCCCGACAAGCTGCAGGTCATAAAACTGCTATCACTAAGCGATGAAGATTGTCGTGACATCTACAGTCACGTTGATGGACCGGATGTGGATAGCACTCAGATATGTTCATTCACCAAGCAGGGAGAAGGTGCCTGTCATGGTGATTCCGGAGGCCCGCTGGTGGAAAAAGGCAAAGTAGTGGGTATAGTGTCGTGGGGCATGCCGTGCGCCAGGGGATACCCTGATGTCTTTACACGAGTATTTGCTTACAAAGACTGGATTATAGAAAACACTTCTGAATGA

Protein sequence:

>DPOGS213051-PA
MLKIIIFFCCIYVAKGFVLPHFIDRPEPWIVGGEDAPGGSAPHQASLRSLFNFHFCGGSIISNRWILTAAHCTLGESSFTMKVVVGTNSLTNGGNSYSVDKIIIHENFSYSEIKNDVSVIKVAKDIIFNELVQPIQLPDANTLGGANLTLTGWGTTSYPGSSPDKLQVIKLLSLSDEDCRDIYSHVDGPDVDSTQICSFTKQGEGACHGDSGGPLVEKGKVVGIVSWGMPCARGYPDVFTRVFAYKDWIIENTSE-