Monarch geneset OGS2.0

DPOGS214588
TranscriptDPOGS214588-TA4323 bp
ProteinDPOGS214588-PA1440 aa
Genomic positionDPSCF300050 - 363687-373761
RNAseq coverage411x (Rank: top 29%)
Annotation
HeliconiusHMEL0040360.063.07% 
BombyxBGIBMGA005119-TA0.072.62% 
DrosophilaCG5639-PA0.035.69% 
EBI UniRef50UniRef50_D6WS960.044.00%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WS96_TRICA
NCBI RefSeqXP_967522.10.044.00%PREDICTED: similar to CG5639 CG5639-PA [Tribolium castaneum]
NCBI nr blastpgi|910874710.044.00%PREDICTED: similar to CG5639 CG5639-PA [Tribolium castaneum]
NCBI nr blastxgi|910874710.044.24%PREDICTED: similar to CG5639 CG5639-PA [Tribolium castaneum]
Group
Gene OntologyGO:00048674e-21serine-type endopeptidase inhibitor activity
GO:00055762.7e-08extracellular region
GO:00304142.7e-08peptidase inhibitor activity
GO:00048573.2e-07enzyme inhibitor activity
KEGG pathway 
InterPro domain[740-848] IPR0007163.5e-21Thyroglobulin type-1
[961-1017] IPR0022234e-21Proteinase inhibitor I2, Kunitz metazoa
[728-770] IPR0081972.7e-08Whey acidic protein, 4-disulphide core
[139-183] IPR0110613.2e-07Proteinase inhibitor I14/I15, hirudin/antistatin
[132-182] IPR0181121.1e-06Proteinase inhibitor I15, antistasin
[153-178] IPR0040941.6e-06Proteinase inhibitor I15, antistasin-like
Orthology groupMCL13520 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214588-TA
ATGGCGCCGAAAGCCGTTAGGGTGGCGTTGTTCTGCGCTTGCCTAGTCATACTACAAGTGAGCGCCGAATTGAAGGGCCGTTGTCCGGCTGACGAGGAAACCTGTCCTCCTCGTGCGACGCCATGTAATGACGACAACGACTGCGGGCATCAGATCTGTTGCAACACCTCCTGTGGGCGATCCTGTGTGGAGCCGCTCTACACCGGATGTGAGAACATAAAGCTGTCTTCGGAGCGAATATCCCGTGCTCTGGCTGCTGAGAACACACGCAGCGGGCGAGGTGTGATGAGGTCGCTGCGATCTCCTCGCTGCAAGGTCTCTGATGGAGAGTTCGAAGAAATACAATGTGATAACGAGATCATAAGCTCGTGTTGGTGCGTGGACGCCGCAGGCTTTGAGGTGCCGGGTACCCGCGCTCCCGCAGCGGGTTTAGTGAACTGTTCACGAACAGCGCCCTGCGCGGCGCACACTTGCCGCATGCTGTGTCCACTCGGCTTCGAACTGGATCCTAACGGCTGTCCGCTCTGCAAGTGTCGCGACCCTTGCTCCACCATCACCTGTCCCAACCAGCTATCCTGTCAGTTGGAAGAGATGCCGTGCTTACGTCCACCCTGTCCCCCAGTACCCACTTGCAAAAGGGGTCGCAGCCTCCAAAACATATGTCCGGTAGGCGAGCCACTTTTCATATCGGAAACGAGACGTCCATTCCTGTGCGGTACGGATCCAGGGAAACCGAACTGCCCGCCTCTGTATAAATGCCTCGTCGAATCTGGCAACGACTACGGCGTCTGCTGTCCAGCGTCACTTGAACTACAAAAGGCCGGTACCTGTCCCGCTCCGAAGTCTTCTGGAATGGACTGTGGGACTCCCTGCGTTCATGACCTCGAATGTCCGTCAATGCAGAAATGCTGCGACGGTGCTGAATGTGGGAGACATTGCGTTCTGCCCCACAACGTCACTATCTGCACTCAGCAGAAAATGCTCGCTGAATTACTGGTTGTTAGCGAGAAAGAAGGTAGAGGATACGTGCCGCAGTGCACGTCTGACGGGTCCTTCCTGTCAAGACAGTGCTCGCGGAACGGGCTCGTGTGTTGGTGTGTAGACACAGACGGCAATAAACTCCGAGGCTCTATGGGACCGTCGGAAACCGTGAAGTGTTCTGCCAAACCCCATCCAGCTCGTACTGGTGCTAGAAGTATTAGTTCCTGTGCGAGGGCCCTCTGCGCCGGGGTCTGCGAGTACGGCTACAAGACCGGCGGCGACGGGTGTCCGAGCTGCGAGTGTGACGATCCCTGTGCTGGGTTCCCCTGCGCAGAGGGAGAGGAGTGCGTGCGAGTCCGGGACGCTGATTGCTCTGGAGAGCTTTGCACTGGTTATCCTGTCTGCCGTCCTAAAATCTCGTATGAGAATCCGTGCTCTGTGGGTGTACCGGCGACGGACGAGCGCGGGGCGGTGTTAACTTGCAGGGAAGGGGGTGAGTGCGGGGAGGGACACAGCTGCACCCGCGGGGGGAGACACGGGCCAGCCGTCTGCTGTCCGCAACCGGATACTGACACGGATAATACCACTGAACCGGAAATACTCGAGATCAACTTCGAAGCGTGTGGTCCGGAGGCTGAAGCGCTCTGCGGAGTGAACTCCACATCCAGCTGCTCGGACGGTGTTTGTGACGGGGACCTGGAGTGCTGCGTGACGGCGGGCTGTGGGCCTGTTTGTGTGGACCAAGACAAATTAAGACTACAGACCGACATTGTTGACGATACGCCCTCTATGTGCGAATACCTCCGAGACTTCGACGAAAAGATGGAAGGTACGGTGGACGGCATGAAGCTGGCTCTTCCGGCGCCGAGCTGCAACCCAGACGGCAGCTTCACGCCGCAGCAGTGTGCCGGCGGACGGTGCTGGTGCGTCGACTCCTTCGGCACTGAAATACCTGAAACGAGCACCAACAACGCATCCGCCGTGGACTGCGACAAGGTGCGGTCGGAGCTTTCCTGCCTCGAACTGACGTGTCGCATGGGTTGCGACTACGGCTTCGAACTGGGCTCCGGGCGCTGTCCCACTTGTAAATGCCGCGACCCTTGCGCCGGCGTCTCGTGCCCCACGGGCCGGGCCTGCGCCCTCGTAGATGTAGCCTGCGACGCGGATTACTGCCCTCCGGTACCTGCGTGTCTTCCGCGGAAGTCAGGTCAGTGTCCGTATCTGGTGCCGTGGACGGGGTCGTGTGAATGGTCGTGTCGCTCGGACGCGGAGTGCGCCGGTGACGCGAGGTGCTGCGCCACGGGCTGCGGAACAGCCTGTGCCGAGCCGCTGAGACAGACTGGCTGTCAACAGAGACGTGCTCTTGCTTTACACACGGCTGCGGAAAGCGGAAACCCACCCTCGTGGTCGTGGGTCCCTCGCTGTAAGGAAGACGGCTCGTATGAAGGCATCCAGTGCAGAGGATCCACCAACATCTGCTGGTGCGTGGACGGCGTTGGCAATGAGATCCCCGGCACTCGTACAAACAACTCTTCACCAAACTGCACCGCGCCAACTCAGTGCCCAGACCCTAAGTGCGATGAACAGGCAATGTGTCCTCACGGCCGGGAATTAAACGAGAAAGGTTGTCCAACGTGCATCTGTAAGGACCCGTGCGCTGATGCCAAATGTAGAGAAGACGAGACCTGCGAGCTGGTGCCTTTAGAATGCGAGGGTGAAACATGTCCACCGTTGGCCCGCTGCTCGCCGTCTCCTCAGTGTCCGTCAGGGGAGCCTCTCCTGGCCCCCGGTGGCGGAGCCCTTCCCTGTGGCCCCCGCGCCGCCGCCTGCCCCTCTACACACGCCTGTCGGTTCGCTCCCCACGACGCCAAACCAGCCGTCTGCTGTCCAAAGCCTCGAACTGTGTGTTTGGAGAATAAAGACGAGGGTATATGCGAGGGGTCAGGTCTGAACGTGACGCGCTGGCATTTCAACTCGGCTAAGAACAGATGCGAGCGTTTCCTGTACCACGGCTGCTCTGGGAATCACAACAACTTCCGGACCAAGGAAGAGTGCAATGCCGTCTGTCCCGTGTTAAGTCCATGCGAGAGACTACGCGAGAAAAACGAAGCAACCGCCTTGAGGTATGGAAAGGGAACCTTCATACCGGCGTGCGAGGAAAGTGGAGCTTGGCAGTCCGTGCAATGTATGGCGCATATTGACGTCTGCTGGTGCGTGAACGCTCGTGGCGAACCGCAAAAGGGCTCGCTTCTCCGCGGAGGGAAGCCGTCTTGCAACTTCCGACAAGCACGGAAATGGATACGACGCGACCCGCTGGATGAAAAAGACAGAGCTGATGAAGTATTAGAAGAACTGATCAGGCAGATGACAACATATAGAGTAGATGATTTCGAAGAACAAGATGAAGAAGATTCCATAGAGCTGGAGGCTGAACACCGTGAAGGCAATGATCTCCAGGATGTGTCCAGCGAAGACAGCTCCGTGCTGTCGGAAGTCGTGGTCCCGAAACTGGCGGAAACTATACGGAAGACACACCCGGTGCTGGTGACGCCGGTGTCAGAACAAACTGGTCTTAAGACAAAGTGTCAACTGATGCAGGAAGAAGTTGATAATGGTGGTGACGGCTACCGTCCTCGCTGTCACCCTGACGGATCGTTCGCTGCACGTCAGTGTGGAAGAAATCGGTGCTGGTGTGTAGACGCCGCGGGACGGACGCGACACGACACCACACATGCCGACCCTTGCGAGGTCACCCAAATAGAGTCCGCTCTGCTAGAGTTGGAATTGATCGGTACAGAGGAAGACGGAAAGAAGACTCAGAATCTTCTCACAACGAAGCTATCAGCACTAGGTGTTCGAGTGCCAGTGACTATGACAAGAGAAAAGGGCGTGGTGAGGCTGCGGGCGGTGTTGCCAGGGTCAAGGGCCGCTGACGTGGTCTATCAGTTGGAAGCACAGGTGAAGAAGGAGAAACTTTTAAACGCCAACAAATCTGAAGATGGAGTGCTTGGAGCTGATGTTATTCGTAGCGAGTACCGCCTCGCGCCGCCGCGCACGCTGCAGAGAGAGATACTCAGCGAGTCGACGGTGTCGGCTGCTACGTCGTATCACACAGCTCTGATCGTTCTAGCGGCCACCTCGGCGTTCATCATCAGCGTGCTCTGTGTGCTGGTGATGTTGTACCGCGCACGTCTGCAGCGAGAGCCGCATAAAGCTGAACGCTTCCTGCCTCCCGCACCGCCTGTGTACGTCCTATCGGCGGATGAGAAAGCTGAACTGGCGAGAGCGCTACACGCTCCACCAGCCCCGGTACCGCCAGCGAACGCTGATGAAAGAGTGTAA

Protein sequence:

>DPOGS214588-PA
MAPKAVRVALFCACLVILQVSAELKGRCPADEETCPPRATPCNDDNDCGHQICCNTSCGRSCVEPLYTGCENIKLSSERISRALAAENTRSGRGVMRSLRSPRCKVSDGEFEEIQCDNEIISSCWCVDAAGFEVPGTRAPAAGLVNCSRTAPCAAHTCRMLCPLGFELDPNGCPLCKCRDPCSTITCPNQLSCQLEEMPCLRPPCPPVPTCKRGRSLQNICPVGEPLFISETRRPFLCGTDPGKPNCPPLYKCLVESGNDYGVCCPASLELQKAGTCPAPKSSGMDCGTPCVHDLECPSMQKCCDGAECGRHCVLPHNVTICTQQKMLAELLVVSEKEGRGYVPQCTSDGSFLSRQCSRNGLVCWCVDTDGNKLRGSMGPSETVKCSAKPHPARTGARSISSCARALCAGVCEYGYKTGGDGCPSCECDDPCAGFPCAEGEECVRVRDADCSGELCTGYPVCRPKISYENPCSVGVPATDERGAVLTCREGGECGEGHSCTRGGRHGPAVCCPQPDTDTDNTTEPEILEINFEACGPEAEALCGVNSTSSCSDGVCDGDLECCVTAGCGPVCVDQDKLRLQTDIVDDTPSMCEYLRDFDEKMEGTVDGMKLALPAPSCNPDGSFTPQQCAGGRCWCVDSFGTEIPETSTNNASAVDCDKVRSELSCLELTCRMGCDYGFELGSGRCPTCKCRDPCAGVSCPTGRACALVDVACDADYCPPVPACLPRKSGQCPYLVPWTGSCEWSCRSDAECAGDARCCATGCGTACAEPLRQTGCQQRRALALHTAAESGNPPSWSWVPRCKEDGSYEGIQCRGSTNICWCVDGVGNEIPGTRTNNSSPNCTAPTQCPDPKCDEQAMCPHGRELNEKGCPTCICKDPCADAKCREDETCELVPLECEGETCPPLARCSPSPQCPSGEPLLAPGGGALPCGPRAAACPSTHACRFAPHDAKPAVCCPKPRTVCLENKDEGICEGSGLNVTRWHFNSAKNRCERFLYHGCSGNHNNFRTKEECNAVCPVLSPCERLREKNEATALRYGKGTFIPACEESGAWQSVQCMAHIDVCWCVNARGEPQKGSLLRGGKPSCNFRQARKWIRRDPLDEKDRADEVLEELIRQMTTYRVDDFEEQDEEDSIELEAEHREGNDLQDVSSEDSSVLSEVVVPKLAETIRKTHPVLVTPVSEQTGLKTKCQLMQEEVDNGGDGYRPRCHPDGSFAARQCGRNRCWCVDAAGRTRHDTTHADPCEVTQIESALLELELIGTEEDGKKTQNLLTTKLSALGVRVPVTMTREKGVVRLRAVLPGSRAADVVYQLEAQVKKEKLLNANKSEDGVLGADVIRSEYRLAPPRTLQREILSESTVSAATSYHTALIVLAATSAFIISVLCVLVMLYRARLQREPHKAERFLPPAPPVYVLSADEKAELARALHAPPAPVPPANADERV-