Monarch geneset OGS2.0

DPOGS204415
TranscriptDPOGS204415-TA1128 bp
ProteinDPOGS204415-PA375 aa
Genomic positionDPSCF300002 - 579472-584615
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0062605e-12964.80% 
BombyxBGIBMGA007720-TA6e-11963.24% 
DrosophilaSpn4-PB1e-5436.66% 
EBI UniRef50UniRef50_Q5MGH25e-11459.09%Serpin 1 n=14 Tax=Obtectomera RepID=Q5MGH2_LONON
NCBI RefSeqNP_001037021.14e-12561.33%serine protease inhibitor 2 [Bombyx mori]
NCBI nr blastpgi|1562548369e-12562.13%serpin-2 [Spodoptera exigua]
NCBI nr blastxgi|1562548362e-12662.13%serpin-2 [Spodoptera exigua]
Group
Gene OntologyGO:00048677.7e-122serine-type endopeptidase inhibitor activity
KEGG pathwaycqu:CpipJ_CPIJ0147192e-56 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[4-368] IPR0002157.7e-122Protease inhibitor I4, serpin
[3-374] IPR0237963.8e-109Serpin domain
Orthology groupMCL10132 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204415-TA
ATGGATAACCAATCCCTATCTTCATCAGTCGCTAAATTCTCTGCCAAGTTCTGTAATGAGTTGAATCAATCTCAGAGTGTTGTTGCATCTCCACTTTCGGCCAAATTTTTATTGGCTCTTCTTACCTTGGGATCTGAGGATCCTGCTCATTCAGAACTACTTTCATCGCTGGGTATTTCTTCTGATGATGAGATTCGCTCATCATTCAAATCTCTGTCACAAAACCTTCTCTCCATCAAAGGAGTTACCCTCAATGTAGCTAACAAGGTGTATATTAAGGAAGGAGACTATGATCTCAATGAGGATCTTAAGAAGGATGCAGTCTCGGTGTTCAATGCTGCCTTTGAAAAGGTTGATTTTTCTCAAAGTAAAGCTGCTGCCAACCTTATCAACAAATGGGTGGAAGACCAGACAAATAATAAAATCAGGAAACTCATCCCGGCTGATAGTCTCAATGCTGGCACCAGTCTTGTGCTTGTTAATGCAATTTATTTCAAGGGCCCCTGGAGAAGTCCATTTGACCCTTTAAATACGAGTGACCAACCATTCCACATCAGTCCTTCAGAGACGGTAGATGTTCCTATGATGTACAAAGAGGATGACTTCTTCTACTCAGAAAGCAAGGAATTGAATGCTCAGCTGCTGTGTCTGGAATATGTGAAGTCTAAAGCCAGTATGTTGATAGTTCTACCAGAGAAGATTGACGGTCTCAACGAAGTCCTCGCCAAGCTGGCTGATGGGTACGACTTGATCGGTGACGTCAGAAATATGTTCAAAAAGGAAGTCCAAGTTACAATACCGAAGTTCAAGATAGAGACTGAAATCGATCTCGCCGAGTTATTACCCAAACTGGGCATTCAGTCAATCTTCGACCAAAATAACTCCGGCTTGACAAAAATCTTGAATAACTCGGAGCCGCTCTCTGTGTCGAAGGCTGTACAGAAGGCCTTCATCGAAGTCAACGAGGAGGGCGCTGAGGCGGCCGCCGCTTCCGCCATGGTGATGGTGGGATGTTGCCTTACACTTGACGAACCTCAAGTGATTAAGTTCACAGCTGACCGTCCGTTCTTCGTGGCCATCATCTCCAACGAGACCATTTACTTCACAGCCACTTACCGCGGAAACTGA

Protein sequence:

>DPOGS204415-PA
MDNQSLSSSVAKFSAKFCNELNQSQSVVASPLSAKFLLALLTLGSEDPAHSELLSSLGISSDDEIRSSFKSLSQNLLSIKGVTLNVANKVYIKEGDYDLNEDLKKDAVSVFNAAFEKVDFSQSKAAANLINKWVEDQTNNKIRKLIPADSLNAGTSLVLVNAIYFKGPWRSPFDPLNTSDQPFHISPSETVDVPMMYKEDDFFYSESKELNAQLLCLEYVKSKASMLIVLPEKIDGLNEVLAKLADGYDLIGDVRNMFKKEVQVTIPKFKIETEIDLAELLPKLGIQSIFDQNNSGLTKILNNSEPLSVSKAVQKAFIEVNEEGAEAAAASAMVMVGCCLTLDEPQVIKFTADRPFFVAIISNETIYFTATYRGN-