Monarch geneset OGS2.0

DPOGS201126
TranscriptDPOGS201126-TA1035 bp
ProteinDPOGS201126-PA344 aa
Genomic positionDPSCF300137 + 446006-454368
RNAseq coverage413x (Rank: top 29%)
Annotation
HeliconiusHMEL0075176e-5738.44% 
BombyxBGIBMGA014404-TA5e-1130.84% 
DrosophilaCG40160-PD3e-0825.30% 
EBI UniRef50UniRef50_D6WCY82e-0627.22%Serine protease H59 n=1 Tax=Tribolium castaneum RepID=D6WCY8_TRICA
NCBI RefSeqNP_001037053.14e-1029.54%clip domain serine protease 11 [Bombyx mori]
NCBI nr blastpgi|2897396372e-0928.39%large serine protease [Glossina morsitans morsitans]
NCBI nr blastxgi|1700672733e-0924.50%elegaxobin-2 [Culex quinquefasciatus]
Group
Gene OntologyGO:00038241.7e-21catalytic activity
GO:00042521.9e-12serine-type endopeptidase activity
GO:00065081.9e-12proteolysis
KEGG pathway 
InterPro domain[99-330] IPR0090031.7e-21Peptidase cysteine/serine, trypsin-like
[122-314] IPR0012541.9e-12Peptidase S1/S6, chymotrypsin/Hap
Orthology groupMCL34383 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201126-TA
ATGGACATTGGACGCGCATTCAAGATGCTGGTGTTTAGCAAGGTCGTTATCATCTTGTGGGTTGCCGTGGAAGTGGTCGCTGCCCAAGCAGACACTACCTCGGATATAGACTTAACGCCACTTTTAACAGACCACTTCGGAAAGTATGTGAAATGCACTACCAAGCATGGTAAAGAGGGTCAGTGTGTCAACGAGAGATTCTGCAATCAAGATCACGTTATTTTACCAGGAGGTGACATTGAAGACATCGATCTTAGAGAAAAACACGTTTGTGTTGGAGAGCTGGACTGGTGTTGCTCGGTTGAATTATTGAACACACCCACACAAGCCAAGACTACACCTAATGATAGGTGTCTAGCGACAAACGATGAGGAGAGTTCGTGGACTGTGGCGCTGTACAAGTTGCAACCAGAAAATCCACAATCATCGCGTCTCTTCTGCGCCGGTGTCCTCATCTCTTCAAACGTGGTGCTCACTTCGGCGACCTGTCTTCTAGCAGCACGTAGCCACAAACTGTATGTTCACGCGCCCGCCTCAACCGTTAAGAAGAATTACACGGTCCAATACCGCGTAGCCCATAAAGATTATAATTCTGGTACACATGTACATGACTTCGGTATTTTGGTGTTGGAGAAAAACGTTGAATGGGGCGACGAAAAGCCAAGGAGTGCCTGTCTCGACTTCATCACCAAATTACAAGGGGACTGTTTAGCAACAGGATTCAGTAGTGACTTTGATGTTGCGACGACATTATTGACCGTCGAGCAGAAGGGTTGTCGTCCCAAACAAGATCCAGGCGACGTGTGTTGTGGCACGAACGTCAGTGACTCTGACGATTGCATCATAACCCCCGGCGCGCCAGTGCTTTGCCTGTCTAGTGACAAAGTGCTGACGGTAGTTGGAGTATCAAGGTCGGTGTGTAAAGATAACAAAAGAGTCAGCATCGGAGAACTGTCTTCAGTCAAAACGTGGCTTCAGAGTGAATTACAGAAAATTAATCTCCAACAAAGTGTGTATACTTATAAAAAGTTATGA

Protein sequence:

>DPOGS201126-PA
MDIGRAFKMLVFSKVVIILWVAVEVVAAQADTTSDIDLTPLLTDHFGKYVKCTTKHGKEGQCVNERFCNQDHVILPGGDIEDIDLREKHVCVGELDWCCSVELLNTPTQAKTTPNDRCLATNDEESSWTVALYKLQPENPQSSRLFCAGVLISSNVVLTSATCLLAARSHKLYVHAPASTVKKNYTVQYRVAHKDYNSGTHVHDFGILVLEKNVEWGDEKPRSACLDFITKLQGDCLATGFSSDFDVATTLLTVEQKGCRPKQDPGDVCCGTNVSDSDDCIITPGAPVLCLSSDKVLTVVGVSRSVCKDNKRVSIGELSSVKTWLQSELQKINLQQSVYTYKKL-