Monarch geneset OGS2.0

DPOGS215680
TranscriptDPOGS215680-TA1242 bp
ProteinDPOGS215680-PA413 aa
Genomic positionDPSCF300041 - 955538-956779
RNAseq coverage289x (Rank: top 38%)
Annotation
HeliconiusHMEL0133991e-6635.20% 
BombyxBGIBMGA003579-TA3e-12552.43% 
DrosophilaCG3734-PA2e-6432.69% 
EBI UniRef50UniRef50_E2BVA86e-7436.49%Putative serine protease K12H4.7 n=2 Tax=Formicidae RepID=E2BVA8_HARSA
NCBI RefSeqXP_623676.27e-8240.77%PREDICTED: similar to CG3734-PA [Apis mellifera]
NCBI nr blastpgi|1107491791e-8040.77%PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
NCBI nr blastxgi|3504228948e-8039.81%PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Group
Gene OntologyGO:00065085.8e-122proteolysis
GO:00082365.8e-122serine-type peptidase activity
KEGG pathway 
InterPro domain[1-405] IPR0087585.8e-122Peptidase S28
Orthology groupMCL17333 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215680-TA
ATGAGATACTTTGAAAATGTTCTATATTGGCAAGAGAATGGACCAATCTTTGTGTTCTTGGGAGGAGAAAGCGCATCGTCACCTCAATGGACTAGATTCGGAATAATCCATGAACTGGCAAAAGAATCACAAGGAGCAATGTATGTAACAGAACATAGATATTACGGCGAAAGTAAACCTAAGAACTTGACAAAAGAAGATCAATTCAAATATTTAAGTTCAAGACAAGCTCTTGCTGATATAGCAAAGTTAATACACTATTTAAAATTGTTACCCATGTATAAAAATTCTAAGGTAGTTGTAATTGGAGGATCATATGCAGGTAACTTAGCCGCGTGGATGAAGGTTTTATATCCAGATTTAGTCGACGCGGCTGTAGCAAGCAGCGCACCAGTCTTAGCAAAAAAAGATTTCTTTGAATATCTTGAAAAAGTTACTGAGGATTATGAAACTTATGGAACTCACGGATGCTCGGACAAGATAAAAAATATATTCGACAGATTTCATCAGCTATTGCAGAGCTCTGAAGGCATAAAACAGTTAAAAAAAGAAGAGAATATTTGTGATTCATGTGATATGAGTGTGATAGAAAACCAGGCCGTATTCTTTGAGGTTAAGACAAGTATATTTATGAGTAATTCTCAATATGGATCAACAAAAACTATTAAACAGCATTGTGAAAAATTAAGCGACGTAAGTTATGATACTAAATCTTTAACGGACAACTCTATGTTACCAATCATATATTCTGAGAAGTTAAACTGTTATGATTATGACTTTAACAGAATGATCCAAGTGATGAAAAGTAATGACGATTTGTTTTGGATATATCAAACATGCACTGAATTTGGCTATTATCAAACAACGAATTCGAAGGCACAGATCTTTAAAAACATCCCATTGGAGTTTTACATAAAAATATGTACTGAAATGTTTGGCAATGATTTTAACGAAACAAGAGTGGATCAGGCAGTAAAAAACACGAATAAACTGTATGGAGGATTAAACCCAAATGTGACAAAGGTGGTATTTTCAAATGGCAACCTAGATCCTTGGAGTACGATAGGTGTTTTAGAGGGCTTATCCTACGACGCCCCAGCAGTAGTTATTCCAAGGTCTACTCACTGCGCCGATTTACTTCCTATTTTTGAACCTGACAATGAAGAATTGAAAGAAGCAAGAAAACACATCAAGTATTTAATTAAGAAGTGGATAGGAATAGATGAATACTTAACTTCATAA

Protein sequence:

>DPOGS215680-PA
MRYFENVLYWQENGPIFVFLGGESASSPQWTRFGIIHELAKESQGAMYVTEHRYYGESKPKNLTKEDQFKYLSSRQALADIAKLIHYLKLLPMYKNSKVVVIGGSYAGNLAAWMKVLYPDLVDAAVASSAPVLAKKDFFEYLEKVTEDYETYGTHGCSDKIKNIFDRFHQLLQSSEGIKQLKKEENICDSCDMSVIENQAVFFEVKTSIFMSNSQYGSTKTIKQHCEKLSDVSYDTKSLTDNSMLPIIYSEKLNCYDYDFNRMIQVMKSNDDLFWIYQTCTEFGYYQTTNSKAQIFKNIPLEFYIKICTEMFGNDFNETRVDQAVKNTNKLYGGLNPNVTKVVFSNGNLDPWSTIGVLEGLSYDAPAVVIPRSTHCADLLPIFEPDNEELKEARKHIKYLIKKWIGIDEYLTS-