Monarch geneset OGS2.0

DPOGS213175
TranscriptDPOGS213175-TA1407 bp
ProteinDPOGS213175-PA468 aa
Genomic positionDPSCF300114 - 246171-249947
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0173587e-2130.46% 
BombyxBGIBMGA007365-TA8e-13450.90% 
DrosophilaCG8213-PC3e-0824.51% 
EBI UniRef50UniRef50_B5DZK01e-0624.51%GA24898 n=5 Tax=Coelomata RepID=B5DZK0_DROPS
NCBI RefSeqXP_002018518.13e-0724.51%GL17749 [Drosophila persimilis]
NCBI nr blastpgi|1984591595e-0624.51%GA24898 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|3800136766e-0724.90%PREDICTED: uncharacterized protein LOC100869093 [Apis florea]
Group
Gene OntologyGO:00038243.2e-18catalytic activity
GO:00042524.6e-06serine-type endopeptidase activity
GO:00065084.6e-06proteolysis
KEGG pathway 
InterPro domain[32-275] IPR0090033.2e-18Peptidase cysteine/serine, trypsin-like
[43-271] IPR0012544.6e-06Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL30495 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213175-TA
ATGATGAATAGAATTAACATATTCCTAATTATCTTTACTCTATCAAATCATCTCAGTGGTTTGTTCGGTCGGATAATATGGGAGGATATAAGATGCTCGTCAGACAGAGAGCAGGCGGTGCCGGAGGGCAAGCTGATGGACATCGGACGGTTCCCTTGGATAGGTGTCGTGCAACATTCATTCTATCTAGGAGGCAAAACAAGATTTGCAGTGACGAGTGCAGTACTGGTACATCCAAAGTACGCGATCGCCTCCGCCGAAGATATATCTAGGATTGATCCTGACACATTGCTGAACAACACGAAGTTCATACTATGGCAGAGCACGAGCAACAAGTTTTCTATGAACGTAGAAGAATACTACCTCCCCTCGGAGTTCACAGAGGGCGTCACGCTCGCCAGCATAGCAATGCTCCTTCTACACATAGACGGCATAGGCGGAACTGTTGCCCCCGTCCTGCCTATCTGCATGCCTATTCCTGGGACGATGCATACATATGACAATTTGTATGCCATCAGAATGAATGAAGATGATGGCGAAATACAAAAAGAAGTCCACAATATGCATTACGTCGTAAATCAAGAGTGCGAAGAATTTTACTACAAAAATAAGCTGAACTACAAGAAGATGTCTCCCTCGAGCAGTATATGTGCCGCGACTGGAGAACAACAGACATGTGTATGGGATGCTGGAGTAGCTCTCATCACCAGGCAGCCCTGGGGATATTGGAAGCTGTTGGGTTTCAGTGTACGAGGACCGGGTTGCAGCGCACCCGCCCGGTTCCTGCCCATCCAACACTATCTGTCCTGGCTAGAAGGGATCATCATCAACGTAGACCGGAGATCAAACAACGAGGACAAAATCTTGACATTTAGAAGAGTGTCCCCGATCGAACTAATAATGTATGAGGGTAAGATCCTTCAACCTAAAGAGTTCGGTATATGTGAACGCAAGATAAGAGGCAACGTGGTGTACAAAGACAGCACTGAACTGCTGATCAATAAGAACTTCGCACAAGGATTCTTCTTCCTGTCAGTCGCTCAAGTAGTGGAGGTGCTGTGTGTTACAATAATCCTGGAGGTGTCGGCGAGGACCAACGCCGCTATTTGGATGGAACATCACTGTCACCGCGGCCTCCTGGGCGACAGTATGAAGTCTACCACGTTCAAGGATTACAGCGCTCGTCAATGCTTCGTGTACTTTAAAACCGAAGCCTATGTAGAATTCCGCTTCTATTTCTCATTCAAGGCATCCTTAGAGGTGACGTTATATGGTAAGGAGGAGAGTCCCAAAATAATTCCAAAACCATGGGTATCTCTTGAAAACACTTACCCGTGGCGGCCGACATACGACTACTTCAGGCAGGCTTCGTTCATGCCTCATTACGCTTGGTGGTGGTCAATGTGA

Protein sequence:

>DPOGS213175-PA
MMNRINIFLIIFTLSNHLSGLFGRIIWEDIRCSSDREQAVPEGKLMDIGRFPWIGVVQHSFYLGGKTRFAVTSAVLVHPKYAIASAEDISRIDPDTLLNNTKFILWQSTSNKFSMNVEEYYLPSEFTEGVTLASIAMLLLHIDGIGGTVAPVLPICMPIPGTMHTYDNLYAIRMNEDDGEIQKEVHNMHYVVNQECEEFYYKNKLNYKKMSPSSSICAATGEQQTCVWDAGVALITRQPWGYWKLLGFSVRGPGCSAPARFLPIQHYLSWLEGIIINVDRRSNNEDKILTFRRVSPIELIMYEGKILQPKEFGICERKIRGNVVYKDSTELLINKNFAQGFFFLSVAQVVEVLCVTIILEVSARTNAAIWMEHHCHRGLLGDSMKSTTFKDYSARQCFVYFKTEAYVEFRFYFSFKASLEVTLYGKEESPKIIPKPWVSLENTYPWRPTYDYFRQASFMPHYAWWWSM-