Monarch geneset OGS2.0

DPOGS204175
TranscriptDPOGS204175-TA2196 bp
ProteinDPOGS204175-PA731 aa
Genomic positionDPSCF300034 - 28800-75903
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0077711e-9683.17% 
BombyxBGIBMGA005091-TA7e-16284.85% 
Drosophilaamon-PA0.080.00% 
EBI UniRef50UniRef50_G5ECN90.070.63%Prohormone convertase 2 n=17 Tax=Bilateria RepID=G5ECN9_CAEEL
NCBI RefSeqXP_308012.40.081.00%AGAP002176-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479673150.081.00%AGAP002176-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479673150.081.00%AGAP002176-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00042523e-57serine-type endopeptidase activity
GO:00065083e-57proteolysis
KEGG pathway 
InterPro domain[1-730] IPR0155003.8e-280Peptidase S8, subtilisin-related
[296-564] IPR0002093e-57Peptidase S8/S53, subtilisin/kexin/sedolisin
[560-706] IPR0089793.3e-46Galactose-binding domain-like
[611-697] IPR0028845.3e-29Proprotein convertase, P
[21-95] IPR0090201.1e-19Proteinase inhibitor, propeptide
Orthology groupMCL14988 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204175-TA
ATGATGTGGTTGCAATGCTTGGCTGCGACGCTGCTGGTTGCCGCGGCCGCTGGGGAAGTCTTCACGAACTCGTTCCTCGTCCGTTTCAAGAGATCCGTCGGCCACGAGGAGGCCCACGAGCTCGCTCTGCAGCACGGCTTTGACAACCTTGGCACGGTTCTTGGCTCCGAGACCGAATGGCATTTCGCACACCGCGGACTGCCTCAAGCTCGGCCGAGGCGATCCATTGCACACGCCCGACTTCTGAAGAGGCATCCACTGGTTCACACAGCGGTGCAGCAGACTGGGTTCAAGCGAGTCAAACGTGGGTTCCGTCCACTTCGCTTACCAGAGACTGCACCAGCACCGGCCGAGCCGCGTGATCCCTACTTCCCCCTCCAATGGTACTTAAAAAATACCGGTCAAAATGGCGGCAAGCCCAAACTTGATCTGAACGTAGAGGCCGCCTGGGCCCAAGGTTATACCGGGGTAAATGTCACCACTGCCATCATGGACGATGGGGTTGATTACATGCATCCAGATTTAAAATATAATTATAACGCAGAGGCTTCTTATGACTTCAGTAGTAACGATCCGTTCCCATACCCTCGTTATACAGACGATTGGTTTAACAGTCACGGCACGAGGTGCGCTGGTGAGGTGGCGGCAGCTCGTGACAACGGCGTCTGCGGCGTGGGGGTCGCCTACCACTCGAAGGTCGCTGGTATCAGAATGTTGGATCAGCCGTACATGACGGATCTTATAGAAGCCAATTCCATGGGACACGAGCCCCACAAGATACACATCTACAGCGCTTCATGGGGCCCCACAGATGATGGGAGAACAGTCGATGGGCCCAGAAACGCTACTATGAGAGCTATCGTCAGAGGTGTTAATGAGAACGCAGAGGCTTCTTATGACTTCAGTAGTAACGATCCGTTCCCATACCCTCGTTATACAGACGATTGGTTTAACAGTCACGGCACGAGGTGCGCTGGTGAGGTGGCGGCAGCTCGTGACAACGGCGTCTGCGGCGTGGGGGTCGCCTACCACTCGAAGGTCGCTGGTATCAGAATGTTGGATCAGCCGTACATGACGGATCTTATAGAAGCCAATTCCATGGGACACGAGCCCCACAAGATACACATCTACAGCGCTTCATGGGGCCCCACAGATGATGGGAGAACAGTCGATGGGCCCAGAAACGCTACTATGAGAGCTATCGTCAGAGGTGTTAATGAGGGTCGGAACGGTCTTGGCAACATCTACGTTTGGGCGAGTGGTGACGGTGGTGAAGATGATGACTGCAACTGTGACGGATACGCAGCATCCATGTGGACTGTGTCCATTAATAGCGCAATAAACGACGGTCAGAATGCGCACTACGACGAGTCGTGTTCCTCGACGTTAGCGAGTACGTTCAGTAACGGGGCTAGAGATCCCAGTACTGGAGTAGCTACAACCGACCTGTATGGGAAGTGTACGGCGACACACTCCGGAACCTCTGCAGCCGCGCCAGAAGCAGCAGGTGTCTTCGCCTTGGCTTTACACGCGAATACACCTAGAGAAATAGACAGTGGAAGTATTGAAGTTGACACTAAAGAGAAATTTCAGGGCCGTTTCCATTGGACTATGAACGGAGTGGGTCTGGAATTCAATCACCTGTTTGGTTTCGGTGTCCTTGACGCCGGAGCTATGACAGCACTCGCCGCAAATTGGCGATCTGTGCCGCCGAGATACCATTGCGAAGCCGGCTCAGTCGACACTCACACCGAACTCCCATCGGAAGGCAGTATCACACTCCAGATAGACACGTCAGCATGCGCGGGCACCCCTAGCGAGGTTCGGTATTTGGAGCACGTGCAAGCTGTTGTCAGTGCCAACGCTACCAGACGAGGGGATTTGGAACTCTTCCTCACCAGCCCTATGGGAACCAAATCGATGATCCTGAGCAGGCGTGCAAATGATGACGACAGCCGTGACGGTTTTACAAAGTGGCCGTTCATGACTACGCACACTTGGGGAGAGTATCCGCAAGGAGTTTGGTCTTTGGAGGCTAGATTCAGCAGCCCAGGTCGATCTGGTTGGTTGCGAGGCTGGTCACTGGTGCTCCATGGCACACGAGCGCCGCCATACGCTCAGCTCCAGCCGCAAGACCCTCGCTCCAAGCTAGCAGTCGTCAAGAAGGCACACGAAGACAACGCTATCAACGACTAG

Protein sequence:

>DPOGS204175-PA
MMWLQCLAATLLVAAAAGEVFTNSFLVRFKRSVGHEEAHELALQHGFDNLGTVLGSETEWHFAHRGLPQARPRRSIAHARLLKRHPLVHTAVQQTGFKRVKRGFRPLRLPETAPAPAEPRDPYFPLQWYLKNTGQNGGKPKLDLNVEAAWAQGYTGVNVTTAIMDDGVDYMHPDLKYNYNAEASYDFSSNDPFPYPRYTDDWFNSHGTRCAGEVAAARDNGVCGVGVAYHSKVAGIRMLDQPYMTDLIEANSMGHEPHKIHIYSASWGPTDDGRTVDGPRNATMRAIVRGVNENAEASYDFSSNDPFPYPRYTDDWFNSHGTRCAGEVAAARDNGVCGVGVAYHSKVAGIRMLDQPYMTDLIEANSMGHEPHKIHIYSASWGPTDDGRTVDGPRNATMRAIVRGVNEGRNGLGNIYVWASGDGGEDDDCNCDGYAASMWTVSINSAINDGQNAHYDESCSSTLASTFSNGARDPSTGVATTDLYGKCTATHSGTSAAAPEAAGVFALALHANTPREIDSGSIEVDTKEKFQGRFHWTMNGVGLEFNHLFGFGVLDAGAMTALAANWRSVPPRYHCEAGSVDTHTELPSEGSITLQIDTSACAGTPSEVRYLEHVQAVVSANATRRGDLELFLTSPMGTKSMILSRRANDDDSRDGFTKWPFMTTHTWGEYPQGVWSLEARFSSPGRSGWLRGWSLVLHGTRAPPYAQLQPQDPRSKLAVVKKAHEDNAIND-