Monarch geneset OGS2.0

DPOGS206573
TranscriptDPOGS206573-TA1446 bp
ProteinDPOGS206573-PA481 aa
Genomic positionDPSCF300108 - 285007-290411
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0153739e-1344.94% 
BombyxBGIBMGA013049-TA1e-0926.22% 
Drosophilagd-PA5e-1121.93% 
EBI UniRef50UniRef50_B0WMN21e-1229.91%Transmembrane protease n=2 Tax=Culex quinquefasciatus RepID=B0WMN2_CULQU
NCBI RefSeqXP_001656391.15e-1427.16%hypothetical protein AaeL_AAEL013140 [Aedes aegypti]
NCBI nr blastpgi|1700446794e-1229.91%transmembrane protease [Culex quinquefasciatus]
NCBI nr blastxgi|1571359272e-1329.92%serine protease, putative [Aedes aegypti]
Group
Gene OntologyGO:00038242.5e-31catalytic activity
GO:00042527.7e-17serine-type endopeptidase activity
GO:00065087.7e-17proteolysis
KEGG pathway 
InterPro domain[263-481] IPR0090032.5e-31Peptidase cysteine/serine, trypsin-like
[267-476] IPR0012547.7e-17Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL34660 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206573-TA
ATGAAGTTGGTATTAATAGATATATTACTTGTGATATTTGTTAAAGTTATTCCAGTTTATAGTGAGTGTGATACTTATTTTGGATGCATTCAGGACACATTGTTTGGATCTTTTATAAATTTAAATGCTACAGAAACTCAAGTTAGTGACATAGATGTTCCAGGTTTCGAGAATGTTAATTTAGAGTCAGATAGTGTTGATACCATTTTGAAACATATATCTGATTTATTTGAAGAATTCGTTGCTGACTATAATTTGAGTTCACAGTTCGACGTAGTTCGTAAGAATAGTAATAAAACATCAATATTTGTTAAAACCATTGAAACTGATAAATTTAACGAAACCTTTGTGAAAAATATAGAAGATAAGAAAAACACTAATGTGACACATTCAAATACGACGAATAAATCAAAGGATGCTGAAGAAAATACTGAGTTACCAACTAAAGAACATTTTCGGGTTATTAAAATGAACGAAAAAAATAAAACAAAAAAATTAAAAGTGGAAACACTCCACAAACCGGTTTTAGATGTAAATGAATTGTCAAAAAAAGTGACGGAAAACGAGATATTGGATATAGATTCAATAGATTACGATTTAATTGATAATAGCACTGATTACCCAATAATTGATTATATAGATCTGTACAAGGATACTACTAACGAAGATGTAAATATGAGAACAACTGTTTTGACATATCCATTAGATGATACCGTTGGTATACAAACAACTACGCTAAACAGTTATATAAACGAGGAAATAAACAATAAAGTTTACACAAAAGACTGCAGTAATACTACTGATATCAAAGAAAATATATTTCCATGGGTAACGGCGATATTTATTAAAAACGAAACAAGTAATCAGTTTGATTACTATTGTGATGGAGCGTTATTATCAGATACAATTATAATAACAGCTGCAAGTTGTATTCAGAAGCCAAACGCTACAGCAGATGACATACTAGTCATATTAGGAAAGAGATCTCTTCGTGACATTACAAACGACGAAATGATTTCCAAGATAAAAGAAGTACATGTACACGACAATTTCACTGGTTCGAATCACGACATAGCAATCATTGAAATGGCAGAACCGGCGACATTAAGTGATCGCATCCAACGCGCACGTCTTAGCGGGGATCGGGGCAAATCTGCAACAACCGGCTGGGCTATTTCTGGTACTCTAACGCTAATTCCTTTTGAAGACGAACATAAAGATTGTAACGAAACGTTACCTGAAAAAACATTTTGTGCTGTCTACGGAAATGATGTTAGCGTCTGTCCAAGTTACGGAGGTCTGTATGCTACCAGAGGAATAGACGGGTGGTTTCTTCGAGGTATACGTAGAGCCGATCCTACTAATAAAGATTTTTGTGTAAATCGATCTTTGGTTTACACGGACTTAAAATACTACAGTGATTGGATAAATATGTATGTTAAATAA

Protein sequence:

>DPOGS206573-PA
MKLVLIDILLVIFVKVIPVYSECDTYFGCIQDTLFGSFINLNATETQVSDIDVPGFENVNLESDSVDTILKHISDLFEEFVADYNLSSQFDVVRKNSNKTSIFVKTIETDKFNETFVKNIEDKKNTNVTHSNTTNKSKDAEENTELPTKEHFRVIKMNEKNKTKKLKVETLHKPVLDVNELSKKVTENEILDIDSIDYDLIDNSTDYPIIDYIDLYKDTTNEDVNMRTTVLTYPLDDTVGIQTTTLNSYINEEINNKVYTKDCSNTTDIKENIFPWVTAIFIKNETSNQFDYYCDGALLSDTIIITAASCIQKPNATADDILVILGKRSLRDITNDEMISKIKEVHVHDNFTGSNHDIAIIEMAEPATLSDRIQRARLSGDRGKSATTGWAISGTLTLIPFEDEHKDCNETLPEKTFCAVYGNDVSVCPSYGGLYATRGIDGWFLRGIRRADPTNKDFCVNRSLVYTDLKYYSDWINMYVK-