Monarch geneset OGS2.0

DPOGS202912
TranscriptDPOGS202912-TA5385 bp
ProteinDPOGS202912-PA1794 aa
Genomic positionDPSCF300126 + 353370-367491
RNAseq coverage708x (Rank: top 18%)
Annotation
HeliconiusHMEL0117740.081.19% 
BombyxBGIBMGA004193-TA0.074.56% 
DrosophilaMcr-PA0.055.62% 
EBI UniRef50UniRef50_Q9VLT30.055.62%LD23292p n=19 Tax=Endopterygota RepID=Q9VLT3_DROME
NCBI RefSeqXP_397416.20.058.40%PREDICTED: similar to Macroglobulin complement-related CG7586-PA, partial [Apis mellifera]
NCBI nr blastpgi|3227884580.059.02%hypothetical protein SINV_07362 [Solenopsis invicta]
NCBI nr blastxgi|1953880100.056.39%GJ20388 [Drosophila virilis]
Group
Gene OntologyGO:00056153.4e-52extracellular space
GO:00055768.1e-25extracellular region
GO:00048662.8e-16endopeptidase inhibitor activity
GO:00055155.7e-07protein binding
KEGG pathway 
InterPro domain[1189-1540] IPR0089307.2e-55Terpenoid cylases/protein prenyltransferase alpha-alpha toroid
[1243-1536] IPR0116263.4e-52A-macroglobulin complement component
[1615-1759] IPR0090488.1e-25Alpha-macroglobulin, receptor-binding
[683-815] IPR0116252.2e-21Alpha-2-macroglobulin, N-terminal 2
[954-1044] IPR0015992.8e-16Alpha-2-macroglobulin
[228-318] IPR0028907.4e-15Alpha-2-macroglobulin, N-terminal
[885-928] IPR0021725.7e-07Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL15769 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202912-TA
ATGAATCTCAAAGTGATATCAATATTATTATTTATAATAACAAATGCAACATGGGCTCAAGACTATTCATCTCAAACATACAAACCTGACTACCAAGTGAACCCAACAAACAATTATCTCAACACCAACGATCCAAAATATGACAATCCGTATGATCCTAACAAAAGATTCTCAAATCGTAATCAATACGATATAAATCGTAATCCATACAGTACTGAGCAAAGATACAATCAGTTCGACCGTAATGACCTAGGGACCAATCAGAACCCATATAAGACGGATACCGGAAGACAGACTTACGACACCAGTAGAACATACAGTACTTCATACACGGGAGCCGAGTTGGAACATGATAGTGTTATTATAAATGAAGCCACCTATTTCATTGTTGCATCTCGAATGATCCGGCCGGGGCAAATATATAAGATCTCAGCTAACATTCTGAAAGCTCGTCTCCAGATGACGATCAGAGCATCCATCTCATGTAACGGTGTGGAAATAGCTGACGTTATAGAGAGAGTGAAGGAAGGTGTTCCTGAGGTTTTGAATATGAGGGTGCCCGCTACAACGGTGCCGGGAGACTACAGGCTGCGCGTGGAAGGTCTCTACCTGGACGATCCATTTGGAGGGAGAGCTTTCGTTAACGAAACAAAATTGACTTTCTCACAGAGATTCATGACCATCTTCATACAGATGGACAAGCCGGTGTACATGCAGTCGCAGACTGTTAAGTTCCGTGTGATTCCAATCAACACGGAGTTGAAGGCTTTCGGCCGAGCCATCGACGTCTTCATACTGGATCCCAACAAGCGTATCATGAAGAGGTGGCTCTCCAGACAGAGTAATTTTGGTACGGTGAGCCTGCAGTATCCCCTGTCGGACCAGCCTGTGTTCGGTGACTGGACGGTGAGGGTGGAGGCGCTGGGGCAGAGAGAGGAGGCCAGCTTCCTCGTGGAGGAGTACTATCAGACCAGGTTTGAGGTGAACGTGACTATGCCGGCCTTCTTCTTCAACACGGACGGGTTCATCCACGGTCGCGTGATGGCCAACTACACGTCGGGCGCTCCTGTGCGGGGCAACCTCACTCTTCGCGCCACCCTGCGGCCGGTGCCGTCGTACAGGCCGAGACAGCAACACAGATACAACACACAACGTCCGAACTACAATCCCTACTTATACAACGAGACGGAGCAGTACGACCAGGGACAGGAGGATAGACGCTGGCAGCTCAACCACCGACCCGGGCAGCCCAACCAGCTCGACCATCCCGACTGGTGGTATGACACGCAGAAGGTCGTCTCCAGGGTGTTTAACTTTGATGAGGAGTTCCCGTTCTGGATGCCGAAACCAGACCCTGTTGAGATCGCACAAATTAATATGAACATCAACAACAACAACATGAACAACATGAACAACATGAACAACTACCCAACAACTACATCATATAATAATTATTACAACAATCCCTACAACGACAAGCTGCCTTATTTAAGATTTTTCAACGGCACGTACGACTTCAAGTACCCGATGTCGGAGCTGGCGCAGCTGGTGCCGTCGTTGGAGGGCGTGGAGGTGATCGTGACGGCGACCGTGGGCGACCCGTTCCTGGACGAGGTGGTGGAGGGCTACAGCGTCGCCCGCATCTTCAACTCCTCGCTGGCCGTCACCTTCCTCGGCGGGGAGCTGCAAGTCTTCAAGCCGCACATGCCTTTCGATGTTTATATGGCGGTGTCTTTCCACGACGGGTCTCGTCTCCCGTGGTGGCTGGCTCGGGGCGCGGTTCTGTCCGTGTCGGCGTCGCTGGAGGGCCGCGGCGGTGCCATCGAAGTGCCTCGGCCGACGCTGGTGCCGGGGGACACCGCCGTGTGGCATCTCAAACTAGACCTTTATAAGCTCCTCAAACTGGACAACGACCCCAACTACCGCGAGGTGTTGAACGGCATCACCGGAGTCCGTCTGTCGGCGTCGCTGGTGGACGCTTCGGGAGGCCGAGCCACCGCTGACGCACACTTCGTCGCCCACCAGAGCCCTAACCATCACCACCTCCGAGTGTCCACATCAACTACAGACGCCAGGGTGGGCGAGTACATCGTCTTCCACGTGCAAAGTAACTTCTACATGGAGTCGTTCAACTACGTCGTCATGTCCAAGGGAATCATTCTGACGAGTGGACAAGAAATGATGCAGGAGGGTGTGCGTACATTTTCGGTGGGCGTGAGCGCTGAGATGGCTCCCGTGGGCACGCTGGTCGTGTGGGCGCATCGCAGACGAGCGCTCGTTCTGGCCGACTCCATCACATTCCCCGTTAACGGCATTTCCACTAACAACTTCACAGTGCACATCAACAACCGTAAGCACCGTACAGGCGAGCGCGTGGAGGTGGCCATCTACGGTGAGCCGGGGGCGTACGTCGGCCTGTCCGGCATCGACCACGCCTTCTACACCATGCAGGCCGGCAACGAACTCACCTACGCTAAGGTGTTATCGAAGATGTCTCATTTCGACGAGTCCACTAACGGCACGTTCTCGTACACGTGGCGCTCGCACTTCGGTGACGCGGACGAGCTGGTGTACTTCCCCTCCGCCAGCTTCGGCATCGACGCCAACAGGACATTCGAATACGTGGGTCTGATAGTGCTGAGCGACGTACCGGTGTGGCGGCGGCCGTCCCCCTGCAACTCGTCCTTGGGTCTCGGCGAGTGTCTGGACGGCTCCTGCTACCCTCTCGCCAAGCGGTGTGACGGCAGCCCGGACTGCGCCGACCGCACCGACGAGGCTAACTGTCAGCGCGAGGACTCATTCGAGCTGTCCCACTTCCGCAAGTTCCGGTTCAACCGCATCCAGCGCCAGTACGACAACGCGTGGCTGTGGCGGGACGTGAACATCGGCCCGCACGGCCGGTACGTGTTCACGGCGGACGTGCCCGCCGCGCCCGCGCACTGGACGATCTCCGCCTTCGCCATGTCGCCCACCAGCGGCATCGGCATGCTGGCCAAGCCGCTACACTACGTGGGTATCTTGCCGTTCTTCATGAAGCTGGAGGGTCCCGACAAGTGTCGCCAGGGGGAGCAGCTAGGCCTGCGTGTGGCGGTGTTCAACTATCAGCCCCAGGACATCGAGGCGGTGGTCGTGCTGGCCAGCTCACCCGACTACAAGTTCGTGCACGTCGAGGAGAATGGCATCGTCCGCTCGTACAACCCCCGCACGTCTTTCGGCGAGCACCAGTTCTTCGTGTACATCAAGGCGGGCGACGCGGCCGTGGTGCACATCCCCGTGGTGGCGGCCCGCCTCGGCACCATCAGGGTGGACGTGCTGGCCTCCACGCTGCAGGGACAGCACCGAGAACACAAATACATCACCGTAGAGGCGGACGGTATACCTCAGTACCGCCACCAGTCGGTGCTGCTGGACCTGTCCAACCGCGCCTACGTGTTCCAGTACATGCACGTGAACGTGACGGAGACTCCCATCATCCCGTACGAGGTGGACCGCTACTACGTGTTCGGCTCCAACAAGGCCAGGATCTCCGTAGTAGGAGACGTCGTGGGCCCGCTCTTCCCCACCATGCCCGTCAACGCCACCAGCCTGCTGGACTTACCGATGGATTCTGCGGAACAGAACATGTTCAGTTTCGCGGCCAACATGTACCTGACGCTGTACATGCGGCTCATCAACCAGAGGAACCGCACGCTGGAGAAGGAGGCCTTCTACCACATGAACATCCTGTATCAGAGACAGCTGTCCTTCATGAAGGCGGACGGATCCTTTGGACACTTCAGGAGTGACTGGAACCAGTCGTCCTCGAGCGTGTGGTTGACATCATTCTGCGCTAAGATATTCCAGGAAGCTTCCTTCAATGAATGGGAGAACTACATTTACATCGATCCCGATGTGATATCTATGGCCGTGTCCTGGATCCTGGAGCGCCAGGCTCCGTGGGGCGCCTTCATGGAGGTGCATAGGGCGGCCGACCGGAACGCTAACAGCACCATCGTCGTGCCGCTCGACTCACACCTGTACCGAGACGTTGGCAACCAGCTGCAGGGAGTCAAGGCGAACAACTCTATAATACTACAGCGGAATATAACGCTCACGGCGCAGGTCGTCATCACCCTGGAGTCTGTCAAGAACCTCAAAGACTTCGGGATACGAGAGGGTCTGTCGGCTCGCGTGTCGACGGCGCAGCAGCTGGGAGTGTCGTGGTTGGAACGGAACCTGCAGCTCGTCCACGAGTTCGGCTCCCCGCTGACGATGGCGCTCCTCTCGTACGCGCTCACCGCGGCCAAGGCGCCCTCCGCCGAGCACGCCTTCAGATTACTCAAGAGGCATCAGCGATCCGAGGGTGGTCTGGTGTACTGGGGCAAGGAACCCGTGCCAGCTCCTCCATATAAAATGGAAAACCAGAAGCCCTTCCTGCTTCCTCGGCTGCCCTACAACTACGACTCCAATAACATCGCCGCCACGGCCTACGCTCTGCTCGCCTGCATGGACCATCAGGACAACAACGAACCAATCGTTATGTGGCTTAATTCACAACGTCTCAAAGATGGCGGCTGGGCTTCCACACAGGACACGTACATAGCCCTCCGCGCGCTGATCGAGTACACCAACAGGAAGCGGTTGCGTGACGTGAGCTCGCTGGCGGTGAGCGTGGAAGCCGTGGCGCTGAGCGGCGAGCCCAGGACGCTCACCGTCAGGAATCAAGACCTCGCTACGCTACAAACGGTCGATATTCCCAGCGCTTGGGGTACAGTGAAGGTGACAGCTCGCGGCGCGGGGTACGCCATCCTTCAGATGTCCGTCCAGTACAACGTGGACACGCCGCGGTTCCAGACACAGCCCCCGCGCCCGGCCTTCGCCCTCCTCGCGCGCGCTCACTACTACGGCCGGAATCAGTCGCACATACACTACCAGACATGCGCCAGCTGGACTCACCTGTCGGAGAGTCCGGTGTCGGGGCTGGCGGTGCTGGAGGCAGCTCTGCCGACCGGCTACATGATACAACAGCAACGACTTGACGGATACGTCATGAGTGGAACAGTTCCCAAACTACAGCGAGCCAGATACCAGCCCGGGAAGATTCTGTTCTACTTTGATTACTTGGATCACGAGTTGACCTGTGTGAACTTCACCATCGAGCGCTGGTACCCCGTCGCCAACATGTCGCGATACCTCCCCATACGAGTGTACGACTACTACGCACCCGAACGTTTCAACGAGACGATATTCGACGCTCTCCCCACTTACTTATTGAACATCTGCGAGGTGTGCGGCTCCTCGCAGTGTCCGTACTGCGCCATCTACAACGCCGGGGCTCCGCTGACCGCCTCCGCGCTCCTGCTACTAGCGACCGTCGCCGCGATCATACGGAACGTACGAACGTAA

Protein sequence:

>DPOGS202912-PA
MNLKVISILLFIITNATWAQDYSSQTYKPDYQVNPTNNYLNTNDPKYDNPYDPNKRFSNRNQYDINRNPYSTEQRYNQFDRNDLGTNQNPYKTDTGRQTYDTSRTYSTSYTGAELEHDSVIINEATYFIVASRMIRPGQIYKISANILKARLQMTIRASISCNGVEIADVIERVKEGVPEVLNMRVPATTVPGDYRLRVEGLYLDDPFGGRAFVNETKLTFSQRFMTIFIQMDKPVYMQSQTVKFRVIPINTELKAFGRAIDVFILDPNKRIMKRWLSRQSNFGTVSLQYPLSDQPVFGDWTVRVEALGQREEASFLVEEYYQTRFEVNVTMPAFFFNTDGFIHGRVMANYTSGAPVRGNLTLRATLRPVPSYRPRQQHRYNTQRPNYNPYLYNETEQYDQGQEDRRWQLNHRPGQPNQLDHPDWWYDTQKVVSRVFNFDEEFPFWMPKPDPVEIAQINMNINNNNMNNMNNMNNYPTTTSYNNYYNNPYNDKLPYLRFFNGTYDFKYPMSELAQLVPSLEGVEVIVTATVGDPFLDEVVEGYSVARIFNSSLAVTFLGGELQVFKPHMPFDVYMAVSFHDGSRLPWWLARGAVLSVSASLEGRGGAIEVPRPTLVPGDTAVWHLKLDLYKLLKLDNDPNYREVLNGITGVRLSASLVDASGGRATADAHFVAHQSPNHHHLRVSTSTTDARVGEYIVFHVQSNFYMESFNYVVMSKGIILTSGQEMMQEGVRTFSVGVSAEMAPVGTLVVWAHRRRALVLADSITFPVNGISTNNFTVHINNRKHRTGERVEVAIYGEPGAYVGLSGIDHAFYTMQAGNELTYAKVLSKMSHFDESTNGTFSYTWRSHFGDADELVYFPSASFGIDANRTFEYVGLIVLSDVPVWRRPSPCNSSLGLGECLDGSCYPLAKRCDGSPDCADRTDEANCQREDSFELSHFRKFRFNRIQRQYDNAWLWRDVNIGPHGRYVFTADVPAAPAHWTISAFAMSPTSGIGMLAKPLHYVGILPFFMKLEGPDKCRQGEQLGLRVAVFNYQPQDIEAVVVLASSPDYKFVHVEENGIVRSYNPRTSFGEHQFFVYIKAGDAAVVHIPVVAARLGTIRVDVLASTLQGQHREHKYITVEADGIPQYRHQSVLLDLSNRAYVFQYMHVNVTETPIIPYEVDRYYVFGSNKARISVVGDVVGPLFPTMPVNATSLLDLPMDSAEQNMFSFAANMYLTLYMRLINQRNRTLEKEAFYHMNILYQRQLSFMKADGSFGHFRSDWNQSSSSVWLTSFCAKIFQEASFNEWENYIYIDPDVISMAVSWILERQAPWGAFMEVHRAADRNANSTIVVPLDSHLYRDVGNQLQGVKANNSIILQRNITLTAQVVITLESVKNLKDFGIREGLSARVSTAQQLGVSWLERNLQLVHEFGSPLTMALLSYALTAAKAPSAEHAFRLLKRHQRSEGGLVYWGKEPVPAPPYKMENQKPFLLPRLPYNYDSNNIAATAYALLACMDHQDNNEPIVMWLNSQRLKDGGWASTQDTYIALRALIEYTNRKRLRDVSSLAVSVEAVALSGEPRTLTVRNQDLATLQTVDIPSAWGTVKVTARGAGYAILQMSVQYNVDTPRFQTQPPRPAFALLARAHYYGRNQSHIHYQTCASWTHLSESPVSGLAVLEAALPTGYMIQQQRLDGYVMSGTVPKLQRARYQPGKILFYFDYLDHELTCVNFTIERWYPVANMSRYLPIRVYDYYAPERFNETIFDALPTYLLNICEVCGSSQCPYCAIYNAGAPLTASALLLLATVAAIIRNVRT-