Monarch geneset OGS2.0

DPOGS202453
TranscriptDPOGS202453-TA1194 bp
ProteinDPOGS202453-PA397 aa
Genomic positionDPSCF300174 + 28740-32235
RNAseq coverage1216x (Rank: top 10%)
Annotation
HeliconiusHMEL0173044e-9154.52% 
BombyxBGIBMGA009953-TA7e-9854.91% 
DrosophilaSpn4-PB4e-5634.81% 
EBI UniRef50UniRef50_P229222e-10453.32%Antitrypsin n=73 Tax=Ditrysia RepID=A1AT_BOMMO
NCBI RefSeqNP_001166850.12e-10754.08%antitrypsin isoform 3 precursor [Bombyx mori]
NCBI nr blastpgi|13781295e-12559.34%serpin 1 [Manduca sexta]
NCBI nr blastxgi|13781295e-13059.34%serpin 1 [Manduca sexta]
Group
Gene OntologyGO:00048674.2e-108serine-type endopeptidase inhibitor activity
KEGG pathwaydgr:Dgri_GH198622e-58 
 K13963 (SERPINB)maps-> Amoebiasis
InterPro domain[1-396] IPR0237969.1e-114Serpin domain
[31-392] IPR0002154.2e-108Protease inhibitor I4, serpin
Orthology groupMCL10132 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202453-TA
ATGAAGCGTTTGATATTGTTTTTATTTTCCTTAGCCATTATGGCAACGGCTAGTGAAAGGAGTTTGGATAAAATCCTTCTTGATAGTGATAACAAATTCACAGCAAAAATGTTTACTGAAGTAATAAAAGCCCAACCAGGAAAAAGTGTAGTGTTATCTGCTTTTTCGGTTCTGCCACCTCTTGCTCAGCTTGCTCTAGCATCTGTTGGGGAATCACACGATGAACTTCTTGATGTAATTGAAATGCCAAACGACAACATTACCAAAGCAGTATTTTCAAAAGCAAAAACAGATTTGAGATCAGAAAAAGGAGTAACCCTTAAGATGGCAAGCAAAGTCTACGTGGCTGAGAATTATGAATTGAACAATGACTTTGCAGACCTTAGCCGAGACGTTTTTGGATCTGAAGTTGCCAACATTGACTTTTGCAAAAGCGAAAACGCCGCTAAAAAAATGAATCAATGGGTTGAAGATGAAACAAATAATCGTATTAAAGACCTAGTAGATCCCACATCCCTAGATGCTAATACCAAAGCTGTATTAGTTAACGCAATTTACTTTAAGGGTGCATGGAAAACTCCTTTTGAAAAGGAAAGAACAACCGACAGAGATTTCCATGTGAGCAAAGAAAATGTTATCAAAGTCCCCACCATGTACAATTCAGACACCTTTTATTACATTGATAGTAAGGAACTCGACGCACAGGTATTGGAACTTAAATATGAAGGAGAAGATTGTGCTCTGTATGTTGTCCTTCCACATGAAATAGATGGCATTAATAAATTGGAAGAAAAGCTCAGAGACCCATCATTATTGGAAAGTGTTATAAGTACTATGTTTGCATCAGAAGTTGAAGTATATCTCCCTAAATTCAAAATCGAAACAACGATTGAACTCAAGGAAGTTCTGAAAAATATGAATGTAAGGCGATTATTTAGTTCTGGCGAAGCTAGACTCGACAATCTCCTAAAGTCCGTTAGTGATTTGTACATTAATGACGCTAAGCAAAAGGCATTTATTGAAGTCAACGAAGAAGGTGCTGAGGCCGCAGCAGCTAATGAATTTGTTGGCTTTGCAAGCAGTTTGGTGACTTATGAACCCCCACTACCAGTTTTTGATGCTGATAGGCCATTTGTTTTTGTTGTAAAGAAAAGTAATATGCCTTTGTTTACCGGAGTTTATGCTGGAAACTAA

Protein sequence:

>DPOGS202453-PA
MKRLILFLFSLAIMATASERSLDKILLDSDNKFTAKMFTEVIKAQPGKSVVLSAFSVLPPLAQLALASVGESHDELLDVIEMPNDNITKAVFSKAKTDLRSEKGVTLKMASKVYVAENYELNNDFADLSRDVFGSEVANIDFCKSENAAKKMNQWVEDETNNRIKDLVDPTSLDANTKAVLVNAIYFKGAWKTPFEKERTTDRDFHVSKENVIKVPTMYNSDTFYYIDSKELDAQVLELKYEGEDCALYVVLPHEIDGINKLEEKLRDPSLLESVISTMFASEVEVYLPKFKIETTIELKEVLKNMNVRRLFSSGEARLDNLLKSVSDLYINDAKQKAFIEVNEEGAEAAAANEFVGFASSLVTYEPPLPVFDADRPFVFVVKKSNMPLFTGVYAGN-