Monarch geneset OGS2.0

DPOGS200898
TranscriptDPOGS200898-TA1485 bp
ProteinDPOGS200898-PA494 aa
Genomic positionDPSCF300066 - 93546-99956
RNAseq coverage541x (Rank: top 23%)
Annotation
HeliconiusHMEL0133990.072.93% 
BombyxBGIBMGA000549-TA1e-16658.08% 
DrosophilaCG9953-PA6e-12947.30% 
EBI UniRef50UniRef50_E3WUW37e-12848.80%Putative uncharacterized protein n=6 Tax=Endopterygota RepID=E3WUW3_ANODA
NCBI RefSeqXP_972061.15e-14752.02%PREDICTED: similar to thymus-specific serine protease [Tribolium castaneum]
NCBI nr blastpgi|910788581e-14552.02%PREDICTED: similar to thymus-specific serine protease [Tribolium castaneum]
NCBI nr blastxgi|910788581e-14352.12%PREDICTED: similar to thymus-specific serine protease [Tribolium castaneum]
Group
Gene OntologyGO:00065081.4e-188proteolysis
GO:00082361.4e-188serine-type peptidase activity
KEGG pathway 
InterPro domain[50-491] IPR0087581.4e-188Peptidase S28
Orthology groupMCL13418 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200898-TA
ATGCATCGAATATTAAGTTCTATATTATTTTTATTTTTACTTACAAGTTTCGTTCAATGTGGTGTCAACTTTAGACTGGGACGGAGTAAACATGGGAATTTAGGAGCACCTGTTGGAGCTGATAAAGACAGTTTGCCTCCAAACAAATATTTCTTACAGAAATTGGATCACTCCAGTCCAACAGACCAGAGATATTGGGAACAGAGATATTTCGTAAATGAAAGTTTTTATGATTTCAACAATCCCGGACCGGTTTTTCTGATGATTGGTGGCGAAGGTACCGCTGATCCTCGATGGATGGTTAAAGGAACATGGATTGATTATGCAATTCATTTTAAAGCGCTCTGCATTCTACTGGAACATCGCTACTATGGACAAAGTCGGCCAACTATGGACCTCAGTGTTAAAAACCTTCAATACTTATCATCATATCAAGCGCTGGCTGATTTGGCTTACTTTATAAATGCTATGAATAATAAATACAAGTTCAATAAAGATGTTAAATGGGTGGTATTTGGAGGTTCATATCCTGGTTCTCTGGCGGCTTGGATGAGACTTAAATACCCTCATTTGGTACATGCCGCTGTCTCTTCCAGCGGACCGCTTGTGGCCAAAGTAAATTTTATGGAATATTTCCAAGTGGTAGTGAATGCTCTTCGTGAGAAGACTGGCGGTGAAGAATGCGTCGGGCAAGTGAAGTTGGCTCACAAACAGATCCAAGAAATTATCAAAACTGACCCCGCTACTATTGAACGAGAGTTTAGAGTTTGCGAACCTTTTTCCAAAGCATCTCAGAATGACATGAAGAACTTTTATAACTCCATCGCCGATGACTTTGCGGATTTAGTTCAATATAATGAGGATAATCGTATCAGCGGTGACAAGATGTATAAGAATCTTACAATCAACTCGGTGTGCGATATGTTGACGGAGCCAGGTGGTAAACCAGCATTCAAGAAGCTAGCGGCCTACAATTCAATAGTACTGAATAAATCCAATCAAACCTGCTTGGACTACGGTTATGACAATATGATAAAAGAAATGAGAAACATCAGTTGGGGATCGGAAGGAGGCCGTCAATGGATGTATCAGACCTGCACAGAGTTTGGGTTTTATCAGACCTCCTCCAGCGAGATAGAAGTATTCGGAGACTTCTCACTGGAGTTCTTCATACAACAGTGCAAGGACGTCTTCGGCAGCAAATTTAACGATGCTTTTATAAACGATGCGGCCAAATGGACGAACAGTGACTACGGTGGATTGAACATACCGGCTAAGAGAGTTGTGTACGTGCACGGCTCGATAGACCCATGGCACGCCCTCGGCATGACCACCACTGAGGAAAACGACGCTCCAGCTATATTCATTAGAGGTACGGCCCACTGCGCGAACATGTACCCTGCAAGCAAGAACGATAATCCGGGGCTGGTGTCGGCGAGGATGGAGGTCCGCAGCTACCTGGAGTCCTGGCTCGGGATGCCCTGA

Protein sequence:

>DPOGS200898-PA
MHRILSSILFLFLLTSFVQCGVNFRLGRSKHGNLGAPVGADKDSLPPNKYFLQKLDHSSPTDQRYWEQRYFVNESFYDFNNPGPVFLMIGGEGTADPRWMVKGTWIDYAIHFKALCILLEHRYYGQSRPTMDLSVKNLQYLSSYQALADLAYFINAMNNKYKFNKDVKWVVFGGSYPGSLAAWMRLKYPHLVHAAVSSSGPLVAKVNFMEYFQVVVNALREKTGGEECVGQVKLAHKQIQEIIKTDPATIEREFRVCEPFSKASQNDMKNFYNSIADDFADLVQYNEDNRISGDKMYKNLTINSVCDMLTEPGGKPAFKKLAAYNSIVLNKSNQTCLDYGYDNMIKEMRNISWGSEGGRQWMYQTCTEFGFYQTSSSEIEVFGDFSLEFFIQQCKDVFGSKFNDAFINDAAKWTNSDYGGLNIPAKRVVYVHGSIDPWHALGMTTTEENDAPAIFIRGTAHCANMYPASKNDNPGLVSARMEVRSYLESWLGMP-