Monarch geneset OGS2.0

DPOGS212294
TranscriptDPOGS212294-TA1944 bp
ProteinDPOGS212294-PA647 aa
Genomic positionDPSCF300077 + 900256-909335
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0149508e-7044.17% 
BombyxBGIBMGA012478-TA1e-0932.90% 
DrosophilaMP1-PC6e-1022.31% 
EBI UniRef50UniRef50_UPI00022B476D6e-1422.66%UPI00022B476D related cluster n=3 Tax=unknown RepID=UPI00022B476D
NCBI RefSeqXP_001603793.13e-1624.74%PREDICTED: similar to polyserase-IA protein [Nasonia vitripennis]
NCBI nr blastpgi|1946688476e-1625.38%PREDICTED: transmembrane protease serine 9 [Bos taurus]
NCBI nr blastxgi|1946688473e-1425.16%PREDICTED: transmembrane protease serine 9 [Bos taurus]
Group
Gene OntologyGO:00038245.8e-28catalytic activity
GO:00042521.4e-22serine-type endopeptidase activity
GO:00065081.4e-22proteolysis
KEGG pathway 
InterPro domain[429-623] IPR0090035.8e-28Peptidase cysteine/serine, trypsin-like
[212-422] IPR0012541.4e-22Peptidase S1/S6, chymotrypsin/Hap
[221-236] IPR0013143.8e-06Peptidase S1A, chymotrypsin-type
Orthology groupMCL34936 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212294-TA
ATGTACCCTATCAGAAGCTTAAGGTCTTTGTCGGGCAAGTTTGCTGTAGCAGGTAATTTAAGAAATGTAGATTTCTACCCACTTGTAGAATCAGATACAACTGGCCAATGGAGAAAATTAAGAAAAGTAGTTTATCCATGGACCTATACATTCCCTAGAGATGATATTGCTATTGTGTTCCTACGTTCTCCATTTATCTACAACAGCTTTGTAAACTATGTACCTATTGCAAGTAAATCAGTTGACTACCAGGGAAAGTGCCTGGTGTCCGGATATGGGCGAATATCCCAGAAGGACTCGTCGGATAAACTTCTTTTAGCCCAATTAGATCTAATCCCAATGCGAGCGTGTAACAGAAAACACCGGCGGAATATGCGAAGATTCGTTTGCACTTCCAGTTTATTCACAGATGTTGGAAAGGGTGACTCTGGTGGACCGTTGGTTTGTTCAAACACTGGTGACCCAAATGAAGAGCCTGGGAAAGGAGTTCTTATCGGTGTAGTGAGTGGACATCGGTACGGAGCAGGCTCATTCTTTACCCGAGTCTCATCTTACTACAAATACGTTAAAAGGAACAAATCGAATAAATTACACTTCAAATACAGCGTTGTTGCCTTCAAATCACTGATAAGACGACCCAGACTTTACAATACTTTTTGTGGAGGCGTCATCATGACACCGACAAAATTGCTCTCCGCAGCTCACTGTTTTGTAACCAAGGGCAATTTTTGTCAGAGATTAATATATAAGGGGGGAATATTAATATCTATGAGGAATAAATATGCTGTAGCAGGTCATTTGAAAAATGTAGATTTTCGCCCCTTTATAGACTCGAACTCCAATGGACAGTGGAGAAGATTGAGAGGAGCTAGCTATCCATCGACTTATAAATTCCCCAGAGATGATATTGCTATTGTGTTCATACGTTCTCCATTTAATTTCAATAACTTCGTCAACAATATACCTATTGCAAGTACACTAGTTGACTACGAAGGTAAATGCCTGGCGTCCGGATTTGGACGAATATCCCAAAAGAAATCATCGGATAAACTCCTTTTGGCGGAACTAGAACTAATACCTATGAAAGAGTGTGACAGAAGGCATCGACAAAATATGAGAAAATTCGTTTGCACGTCCAGTATAGTGTCGGATGTTGATAAGGGTGATTCTGGTGGACCCTTGGTGTGTACAAACACTGGTGATCCAAACGAAGAGTTAGGAAAAGGAGTTCTTATCGGTATAGTTGCTTTCAAATCAGTAGTTCAACGGCCCAGACTTTGTAAAACCTTTTGTGGAGGTGTCATTATGACACCAACGAAGTTGCTTTCTGCAGCTCATTGTTTTGTGGAGAAGGGCAATATTTGTCAGAGACTATTATATGGCACGGGATCCTTAAGATCATTAATGGACAAGTATGCCGTAGCAGGTAATTTAAGAAATACAGATTTTCGCCCCCGTGCGGACTCGAATAATCAAGGACAATGGAGAAAATTAAAAAGAGTTGTTTACCCAAAAACCTATAAATTCCCCAAAGATGATATTGCTGTAGTGTTCCTACGTTCTCCGTTTATTTATAACAGCTATGTCAACTATGTACCTATTGCAAGGAAATTAGTCGACTACCACGGAGAGTGTCTGGTGTCCGGATTTGGACGCATTTCTCATAAGGCTTCATCGGATAAACTTCTATTGGCGAATTTAAAACTTATGCCAATGAAAGGTGATTCTGGTGGACCGTTGGTGTGTGCAAACACTGGTGATCCGAATGAACAGCTTGGGAAAGGAATTCTTGTCGGTATAGTGAGTGGACATCGGTACGGATCAGGCTCATTCTTTACCCGAGTCTCATCTTATTACAAATATATACAACTTAGCAAATCAAATAGATTACACTCTAGAATTAGCATCGTTATAATAATACAAACAATAATATTGCTGTTTTGA

Protein sequence:

>DPOGS212294-PA
MYPIRSLRSLSGKFAVAGNLRNVDFYPLVESDTTGQWRKLRKVVYPWTYTFPRDDIAIVFLRSPFIYNSFVNYVPIASKSVDYQGKCLVSGYGRISQKDSSDKLLLAQLDLIPMRACNRKHRRNMRRFVCTSSLFTDVGKGDSGGPLVCSNTGDPNEEPGKGVLIGVVSGHRYGAGSFFTRVSSYYKYVKRNKSNKLHFKYSVVAFKSLIRRPRLYNTFCGGVIMTPTKLLSAAHCFVTKGNFCQRLIYKGGILISMRNKYAVAGHLKNVDFRPFIDSNSNGQWRRLRGASYPSTYKFPRDDIAIVFIRSPFNFNNFVNNIPIASTLVDYEGKCLASGFGRISQKKSSDKLLLAELELIPMKECDRRHRQNMRKFVCTSSIVSDVDKGDSGGPLVCTNTGDPNEELGKGVLIGIVAFKSVVQRPRLCKTFCGGVIMTPTKLLSAAHCFVEKGNICQRLLYGTGSLRSLMDKYAVAGNLRNTDFRPRADSNNQGQWRKLKRVVYPKTYKFPKDDIAVVFLRSPFIYNSYVNYVPIARKLVDYHGECLVSGFGRISHKASSDKLLLANLKLMPMKGDSGGPLVCANTGDPNEQLGKGILVGIVSGHRYGSGSFFTRVSSYYKYIQLSKSNRLHSRISIVIIIQTIILLF-