Monarch geneset OGS2.0

DPOGS201116
TranscriptDPOGS201116-TA1305 bp
ProteinDPOGS201116-PA434 aa
Genomic positionDPSCF300137 + 206501-208614
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0179841e-2729.73% 
BombyxBGIBMGA013665-TA4e-2828.78% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastpgi|3322413887e-0727.33%PREDICTED: kallikrein-15-like isoform 3 [Nomascus leucogenys]
NCBI nr blastxgi|1248081853e-0624.88%transcription factor with AP2 domain(s), putative [Plasmodium falciparum 3D7]
Group
Gene OntologyGO:00038247e-25catalytic activity
GO:00042527.9e-06serine-type endopeptidase activity
GO:00065087.9e-06proteolysis
KEGG pathway 
InterPro domain[60-202] IPR0090037e-25Peptidase cysteine/serine, trypsin-like
[63-129] IPR0012547.9e-06Peptidase S1/S6, chymotrypsin/Hap
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201116-TA
ATGCATTCCACTGAACATAATATGGATTTTATTGAGGATTTTTTGAAAAAAATTGATAAACTCACAATCAACGACTTCTTGGATAAAGAATCTGATGTGTTCATACAAAAAACTAGAAGGTTCGAAAATAATCGATATTATGACTTAAAAATATGGAGTGAAAAACCAAAGAGACGATTATACAAAGGCGAGGCGACCGGCATCAATGACTACCCATTCATAGTGAGCGTTCACGTCAAGGAAGAATTCAGTTGCTCTGGAAACATCATCAGTAAAGCTTTTGTACTGACATCAGCTTCCTGTCTACAAACAATGGAAAAACACACGTATTTTAGAACAAATAGAAATATACGAAGGATTGATTTTGACGATAGAAAAAGAAAAGTTCTGGAGAGCTCACATCAAATCGCCCTTCTAGGTTGGGGGTCGAAACTTAAGGACGACGGTGGCCCTGCCATTGTGAACGGTATACTAGTTGGAGTTGTCAGCTTCGCTCCAACGATTTGCGGGGCGCGGAACTCACCGACTGTGTTTACAAAACTGAGCTGCTATAGTAAATGGATTCGAAGCATTATTAATGAAGACGCCTCAATGCAATGGAACACTACAACTGAAAAAGAAACCTTATCCCAATCAAATTATACTCGAAACGTTCATGTTAAAAAGAATGATAACGAAAATAATTTAAAAAATACAGGGAACAAAGACTTCGAGTCTTACATCCTCCGTGGACAAAATGATTCTATAAAAAAATTAAACAAAAAATTACCATACGAGATCAAGTTGAAGGTCAACAACAAGGATAAAATAAAGAGCACGATGAAATACAACAAAACTCATGTTGACCAAAGAAATGATCAAAGAAGTGTTGTAGAATTACCAAACAATTACAACTATTATAGTGTTGAAAGAGATTATTCAGACGAAGAGGGATTGAGCATAGAAGATATTAAAGAGAATACACTAAACCAAGATAACACAACGACCCGAGACGCTGAAACACAATCTGATCGTAAAACTGATGTACAGAAAGAAAAAGTCATAGAGGAACATTTTATTAATATTCTTGAGAACATTTTAAAGATATCGACTAAAAAGCAATTTAAAGATTTCTTACGGCGAGTTACTTTAAGTTTGAAATCGACTCTGGCAACACAAACTCACAAACGTAATATAAGAAATGGCGACACTTTAATAGATAGAAAAAGACAAACAAGCTCGACTAAAGCTATGTCCTCGACCTCGACCACGGCAAAGCCCTTAAATATTTACAAAAAAATTATTGACTTCTTCACCAAAAACTGA

Protein sequence:

>DPOGS201116-PA
MHSTEHNMDFIEDFLKKIDKLTINDFLDKESDVFIQKTRRFENNRYYDLKIWSEKPKRRLYKGEATGINDYPFIVSVHVKEEFSCSGNIISKAFVLTSASCLQTMEKHTYFRTNRNIRRIDFDDRKRKVLESSHQIALLGWGSKLKDDGGPAIVNGILVGVVSFAPTICGARNSPTVFTKLSCYSKWIRSIINEDASMQWNTTTEKETLSQSNYTRNVHVKKNDNENNLKNTGNKDFESYILRGQNDSIKKLNKKLPYEIKLKVNNKDKIKSTMKYNKTHVDQRNDQRSVVELPNNYNYYSVERDYSDEEGLSIEDIKENTLNQDNTTTRDAETQSDRKTDVQKEKVIEEHFINILENILKISTKKQFKDFLRRVTLSLKSTLATQTHKRNIRNGDTLIDRKRQTSSTKAMSSTSTTAKPLNIYKKIIDFFTKN-