Monarch geneset OGS2.0

DPOGS212658
TranscriptDPOGS212658-TA942 bp
ProteinDPOGS212658-PA313 aa
Genomic positionDPSCF300198 - 260473-262915
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0109991e-8969.23% 
BombyxBGIBMGA013658-TA3e-11468.80% 
DrosophilaCG11192-PB7e-2933.20% 
EBI UniRef50UniRef50_A1ZBU19e-2733.20%CG11192 n=13 Tax=Drosophila RepID=A1ZBU1_DROME
NCBI RefSeqXP_002034682.17e-2934.62%GM19789 [Drosophila sechellia]
NCBI nr blastpgi|1953360941e-2734.62%GM19789 [Drosophila sechellia]
NCBI nr blastxgi|1953360945e-2734.62%GM19789 [Drosophila sechellia]
Group
Gene OntologyGO:00038241.8e-52catalytic activity
GO:00042524.3e-38serine-type endopeptidase activity
GO:00065084.3e-38proteolysis
KEGG pathwaydpo:Dpse_GA218793e-21 
 K01312 (E3.4.21.4, PRSS1, PRSS2, PRSS3)maps-> Neuroactive ligand-receptor interaction
InterPro domain[35-296] IPR0090031.8e-52Peptidase cysteine/serine, trypsin-like
[64-288] IPR0012544.3e-38Peptidase S1/S6, chymotrypsin/Hap
[80-95] IPR0013142e-06Peptidase S1A, chymotrypsin-type
Orthology groupMCL26106 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212658-TA
ATGGTTACTCGAAATATTTTAATTTTCTCTTCGTTTTTTATTTACTTACATTTTAATAGTAGTTATATTTTAAGTGCGACCACAGCTAAGTCACAACTAGAATGTGAATTTCCACCAAAGCAAGAAGAAGACTTTACTCACATAAGAAGGAAAAGATTAACAACAGAGTCAGGACTTGTCTGCTTAGAAGATTATCCTTACACTGTGTCCATACGATTGCACGGGAAGCATTGGTGCGGCGGCGCTATAGTTGGTCTTCAATGGATACTCACAGCTGCTCAATGTTTTGACTATGTTACTATAGAAGAAGTAACAGTTCGCATGGGGTCGGTGTTTAGAGACTTCGGCGGTAAAATGGTATCAGTTATTGAAATCCAACGTCATCCAGAGTACAAGATGGATCAATATTATCCCGAACATAACTTGGCTTTAATTAAGATAAACATCAAAATAGAGGTGAGTAGCAGGATCCAAACGGTATCACTTGCAACGACTGATGCTGAAATTCCATCAATGTTTGGAGAGGTTGTTGTCACTGGATATGGCTCTGTGGTGACAGGTCAAATAAGAGAAGGAGTGAATCAAGAACTTCGTCGCATGATAGTTAAACAAGTTCCTCGGTCGGATTGTCGTGTTGTATATGGCAATGACTATCACATACAGCACGATCACATGTGCTTGGAAAGTGTTGTCAAAGGAGTAGCTTTGTGTGCGGGCGACACAGGCGACCCTGCGGTACATTTCAGCGGTATCAATAGGGCAGGCACGTTGTTTGGGATAGCTTTGTTTTCTGGTACTGAGGAATGCGCTAACAAACACAAACCCGGCGTCTATGCCAAGGTCTCCCTTAGCCGTGCCTGGATCAATGGAATTTTAAATGAAAATGAACGTTTTGAAAGTGCTGATAGTGTTGAAAATGCAATACCGATTGATATGGACTAG

Protein sequence:

>DPOGS212658-PA
MVTRNILIFSSFFIYLHFNSSYILSATTAKSQLECEFPPKQEEDFTHIRRKRLTTESGLVCLEDYPYTVSIRLHGKHWCGGAIVGLQWILTAAQCFDYVTIEEVTVRMGSVFRDFGGKMVSVIEIQRHPEYKMDQYYPEHNLALIKINIKIEVSSRIQTVSLATTDAEIPSMFGEVVVTGYGSVVTGQIREGVNQELRRMIVKQVPRSDCRVVYGNDYHIQHDHMCLESVVKGVALCAGDTGDPAVHFSGINRAGTLFGIALFSGTEECANKHKPGVYAKVSLSRAWINGILNENERFESADSVENAIPIDMD-