Monarch geneset OGS2.0

DPOGS206797
TranscriptDPOGS206797-TA4635 bp
ProteinDPOGS206797-PA1544 aa
Genomic positionDPSCF300001 - 4550876-4557741
RNAseq coverage836x (Rank: top 15%)
Annotation
HeliconiusHMEL0250470.070.92% 
BombyxBGIBMGA000591-TA0.060.73% 
DrosophilaCG14964-PB3e-13247.11% 
EBI UniRef50UniRef50_Q8T1040.058.52%Projectin-like protein n=1 Tax=Bombyx mori RepID=Q8T104_BOMMO
NCBI RefSeqNP_001108467.10.058.52%projectin-like protein [Bombyx mori]
NCBI nr blastpgi|1692346920.058.52%projectin-like protein [Bombyx mori]
NCBI nr blastxgi|1692346920.059.01%projectin-like protein [Bombyx mori]
Group
Gene OntologyGO:00057371.9e-173cytoplasm
GO:00082351.9e-173metalloexopeptidase activity
GO:00041771.9e-173aminopeptidase activity
GO:00195381.9e-173protein metabolic process
GO:00301451.9e-173manganese ion binding
GO:00056221.3e-80intracellular
GO:00065081.3e-80proteolysis
GO:00055158.6e-14protein binding
KEGG pathwayaga:AgaP_AGAP0019522e-130 
 K11142 (LAP3)maps-> Glutathione metabolism
    Arginine and proline metabolism
InterPro domain[1054-1544] IPR0113561.9e-173Peptidase M17
[1223-1533] IPR0008191.3e-80Peptidase M17, leucyl aminopeptidase, C-terminal
[305-429] IPR0089576.2e-27Fibronectin type III domain
[326-414] IPR0137835e-22Immunoglobulin-like fold
[1060-1191] IPR0082833.5e-20Peptidase M17, leucyl aminopeptidase, N-terminal
[242-318] IPR0130981.4e-15Immunoglobulin I-set
[322-403] IPR0039618.6e-14Fibronectin, type III
Orthology groupMCL10324 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206797-TA
ATGGGAAACCACTCATCCTCCCACGTCGCTCACAAGTCTAGGAAGAACGTTCACTGGAAGTCTGCAGGACGGCCAGGAGTGCCGGGTAAGCCAGAAATTATTCCATTTCTCTCCGATGAAGAACCCAACGCTATAACACTGAAATGGAACCCACCGAGCCACGACGGCGGGGCACCTCTACAAGGTTACCAGGTAGAATGTAACCGTTTAGGTTCACCGGACTGGGTGCGAACTGCTCCACCAGTGGTATGGCGACCGGAATTGCTCCTAAGCGGCCTCGAACCACCTCATCGTTACCAATTTCGTGTGACAGCAATCAATGCCGTAGGTCGCTCTGATTACAGCGAGTTATCTGATATCCTTACTGTCAATGCAAACCGCACATCACAAGAACCACCTTTATTCCTACAACATGTAGAAAATGTCACAGCTCTTGAAAATGACAAGACAGAATTTCGTGTTTCATTTACGGGTACTCCTGTACCAACAATAGCATGGTTTAAGGATGACTATGAAATATTCAGCAGTAGACGGACCGCAATATCAACCACAGATTCAACAAGTATCCTTATTTTTCATCAAACAATTGCTAGTGATGAGGGCGAAATCAAATGTACGGCAACTAACCGTGCCGGCCATGCTGTGAGTAAAGGAAGGTTATCATTAGAAGCCGCCCCTAAATTACGATATCCTCGACAGTACGAAGACGGACTGCTATACGAAATAAACGAAACAGTTTTCTTGAAAACGGCAATAGTAGGAAAACCAACTCCTGTTGTAGAGTGGCGACATGACGGGCAACCTATAAGTATAAATGAACGTATACAAATAACAACCACACCAAAGTTTTCTATGTTAAAAATATTATCAGCTAGACGAAGTGACCGTGGTGAGTACCAAGTTCATGCAAAGAATAATATAGGAGAAGATACAGCATCCTTCTTAGTGACGATAACCGCACCTCCTGATCCGCCCCGAAATGTATCGGTGGCCCGGCAGGTAGATAAGTCGGTAACTTTGAATTGGGAACCACCTGAAGATGATGGAGGTTGTCGTATCGGTAACTACGTAGTTGAGTATTATCGCAGCGGTTGGAATGTGTGGTTGAAGGCAGTTACCAGTCGTAAAACGAGCATCACGCTTTTCGATCTTATTGAAGGCAGCGAATATCGATTCCGTGTCAAAGCAGAAAGCCCATACGGAATGAGCGCTCCTAGTGTTGAATCTACGCCAGTTAAAATACCAGGGCGGGCTGTGGATATGGAATTTTTGGCGGTAGAGTCTAAAATAATAAACGAGGCGATGTTAAAAGAAGGAGGAGAAGCGATGCCAATCTCTCCAACTCCAAGACGGAAAAGATTGTCCGCGCCGGCTGATTCAACTCTTTTAGAAGAACCCTCGCCCGTACCACTAAGAAAAAAACCTGCACCTTCCAAGTCCGAAGCTACAAGCAACGAGTTTATGTTAGTGCTTTACCCAGACGCTAAATCAGATGACAAAACTGAAAAACGGAAATCGTTTCAGCTGGATTTAGAAGACGCGTTATCGCCCCCACCTATTTCGTTATCAGCCCCTGAGTTAAGTTCTAGATCTACGTTGCCTTTCAAAACTCTACGAAATGCTGTCAGTTCCACGGAGCTTTTACACGAACGAGCAATGGCTCGCTTCTACAAAGCCGTTGCCATGAAAGAAGAACAAAGTAAACAAATGCAAAAAAACATTCCCAGTAATGCCTTTGAAAACAACAATGCAATCAACGGTCATACTTCAAAACTCAAACATGATATAGTTAAAAATGCACCCTCTCTGAATATGTCTATAAAGAGTGACAGTGTAGAAGAAAAAGAACTTGAAAATTCAAAGTTTCAAATCCGTCAAGATTCCATAAACTCTGAAAAATGGCAACAGATGTCCTTCGATGAAGACTACACGGCTAGTACAGTTTCCACCGATGGCGATTATTCAGAAGACGAAGACGTTAGCCTTACTGAAGAGATTCAGAGAGAAAAACAATTACTAGAAGAGGAAGCCACCTATAACCCTAGAAACAAAAACATTCACAACACATCTCAGTCAAAAATATCTTTAGAAGAAGAAGATCACAATGACGAGCTTATTCCACTTTCACCACTACCAGATCCCAATTTTGTACCAAAACCTATTTTAAAGAAACGGGAAAATACAGAACCAATCAATTTTACATCAATAGAAACAAAAAGTGATTTAAAAGAAAGTAATGAAATATTAACAAAAACTCAAAAGGAAAACAAGAGAACATTATTTCAGAAATTGACGAAGCAAAAACCATTTCATTTTCCGAAAATTCTTAACAAGAAAGACACAAATATATCAGAGGAAGCAATGCAGACGAACAAAGAAAAGCCTATTAATAAAGTAGAACCCATAGATGACAAATTAGGCGATGAAGGAAAAACTGTCATTGATTACTATGGCAATATTGTAAAAGAGTACGGAAGTGCTAAGAAGTCCAATACCCCTCTATACCTAAATACAGAAGATCTTAAATATGTTGCTGAACAGCAATACAAAAATGATCTCCAATCATCTAAAGTCTTAGATCACAAAGACACAGTTATTAAACCGGCTCAAGATAAAAAAGTTACAACAACAAAAACTAAAACAAAAATCAGGGCCAAACCATTAAACAAAAGTTCAAAAAAAGAAAATTTGGATAAACAAACGGAAATAACTGAAAACAAAGTACACCTGTCAAAAGCTCAAAATTGTAATCAAAAATCTGAAATCAAAAATGTTGTGCTTAAAACAACAGAAAGAGCTACAATCGTCATACCGATAGATTATAGAAAGCTAGAAGAGAGAGCTAAAATAACCGTTAGATCAGCAATCGACTACACAGTAGATGTGTGCCTGTTGTTGTTGGCTTTTTGGCTTTATATTTTTAAAGACGAAAGATTAGCTATACCAATTTTAATACTCATTATATATCGGCAATTACAAGAGACTATTTTTCGAAATATTCCAGAATGGATCAAACACTATACTCCTCAGTGGCTTAAAAAGAAAACTTCACATGGAAGTAGATGTATAAGATTTATTTCACAATTCAAAATTCCGAGATCCGACTGTGTACTTCCTAAAAACTGGCGTAAGGTTAACGATCCCAAACTTTGCGGCATTCGAGATAAGGGTTTAGTTCTTGGAGTGTATTATAATGAAAACGTAGTAGGAGAAGGTACTATACTAACGGCGAGTGCTCAAAAATTTGATAAAAAATGTAAAGGTAAGCTTTGGAATCAACTAAAACTTACACCGACACCCCGCCTCGGAGAATATAGAGTATTTTATGATTTAGACCCTATTTATGGATATGTAGCTGTAGCCGGATTAGGCTCTGAATGCTTAGCATTTAATGAAGTAGAACAGTTGCACGAAAGTAAAGAAGCTATAAGAATAGCCGCAGGCATTGGCACACAGGCCTTGTATAAATTTAAACCGTCTGCAATTCACGTTGAATCATTCGGAGACGCTGAAGCATCCGCAGAAGGAGCTTCTTTGGCAACTTGGAGGTTTCAAGAATATAGGAGTCCAAAAAGTGAGATGGTTATCCAAAAACCTAAGCTAGAACTTTTTGATGACTGTGACTTTGATGGTTGGAAAATAGGTGAACTTAAAGCCGAGTCTCAAAACTTAGCCCGCTTTCTGCAAGAAATGCCCCCAAATATTTTAAACCCAACAAAATTTGCAAAACTGGCAGTTGACTTGCTTTGTGAACTGGATATTAACGTTGAAATTAAGACTCAGGGTTGGGCAACCAGCCACGAAATGGGTGGATTCGTGGCCATCGGTAAATCGTCACTACAACCTCCACTTTATGTTGAAATAAGCTATTACGGAGCTAATGAACGTACAAGACCTATAGTTTTAATTGGGAAAGGAGTCACCTTTGACAGCGGTAGTGTGGACCTAAAATCATCAAATGCACTTCGTCATATGAGAGGCGATATGGCTGGAGCGGCTTGTGTACTCGCTATTACCCGAGCAGCAGCATTACTTAAATTACCTGTCAATATACGCGGTATTTTGCCGTTATGTGAATTAATGCCTAGTGGTAAAAGTCCTAAATTTGGAGATATCGTATCCAGCGCAAGTGGAAAATCAATACATATCAGACTTCCCTCTCGTGAAGGTCGCCTCTTGTTCGCAGATAGCTTAGTGTACGCTAGAAATTATTGGCCTAAAATGATTTTAGATATAGGTACAATGTCGAAGGAATTAATTTATACGCTTGGTGGAGCTGCTTGCGGCTGCTACACAAACTCTGAGGAATTATTTTGTTACGCAGAATCAGCCAGCAGTCAAACTGGTGATAGGATTTGGCGAATGCCATTATGGAAATTCTACGAAGAACGTTTAAAGGATTGTCACGTTGCCGATCTTGCTAACACAGCCACTAATGATTACGGAGACTCTCCTAATTGTGCAGCTTTTCTTAAACAATTTGTTTGTGATAGTCAATGGATGCACTTTGACACTTATAACGTTTCCTATACAGAAGGTCACGATTTTACTTATCTACAGAAAGGAATGACCGGCCGTCCCACAAGAACTATCATTGAACTGATGTATCAATTGCTAGGTGATGCTAAGTTGTGA

Protein sequence:

>DPOGS206797-PA
MGNHSSSHVAHKSRKNVHWKSAGRPGVPGKPEIIPFLSDEEPNAITLKWNPPSHDGGAPLQGYQVECNRLGSPDWVRTAPPVVWRPELLLSGLEPPHRYQFRVTAINAVGRSDYSELSDILTVNANRTSQEPPLFLQHVENVTALENDKTEFRVSFTGTPVPTIAWFKDDYEIFSSRRTAISTTDSTSILIFHQTIASDEGEIKCTATNRAGHAVSKGRLSLEAAPKLRYPRQYEDGLLYEINETVFLKTAIVGKPTPVVEWRHDGQPISINERIQITTTPKFSMLKILSARRSDRGEYQVHAKNNIGEDTASFLVTITAPPDPPRNVSVARQVDKSVTLNWEPPEDDGGCRIGNYVVEYYRSGWNVWLKAVTSRKTSITLFDLIEGSEYRFRVKAESPYGMSAPSVESTPVKIPGRAVDMEFLAVESKIINEAMLKEGGEAMPISPTPRRKRLSAPADSTLLEEPSPVPLRKKPAPSKSEATSNEFMLVLYPDAKSDDKTEKRKSFQLDLEDALSPPPISLSAPELSSRSTLPFKTLRNAVSSTELLHERAMARFYKAVAMKEEQSKQMQKNIPSNAFENNNAINGHTSKLKHDIVKNAPSLNMSIKSDSVEEKELENSKFQIRQDSINSEKWQQMSFDEDYTASTVSTDGDYSEDEDVSLTEEIQREKQLLEEEATYNPRNKNIHNTSQSKISLEEEDHNDELIPLSPLPDPNFVPKPILKKRENTEPINFTSIETKSDLKESNEILTKTQKENKRTLFQKLTKQKPFHFPKILNKKDTNISEEAMQTNKEKPINKVEPIDDKLGDEGKTVIDYYGNIVKEYGSAKKSNTPLYLNTEDLKYVAEQQYKNDLQSSKVLDHKDTVIKPAQDKKVTTTKTKTKIRAKPLNKSSKKENLDKQTEITENKVHLSKAQNCNQKSEIKNVVLKTTERATIVIPIDYRKLEERAKITVRSAIDYTVDVCLLLLAFWLYIFKDERLAIPILILIIYRQLQETIFRNIPEWIKHYTPQWLKKKTSHGSRCIRFISQFKIPRSDCVLPKNWRKVNDPKLCGIRDKGLVLGVYYNENVVGEGTILTASAQKFDKKCKGKLWNQLKLTPTPRLGEYRVFYDLDPIYGYVAVAGLGSECLAFNEVEQLHESKEAIRIAAGIGTQALYKFKPSAIHVESFGDAEASAEGASLATWRFQEYRSPKSEMVIQKPKLELFDDCDFDGWKIGELKAESQNLARFLQEMPPNILNPTKFAKLAVDLLCELDINVEIKTQGWATSHEMGGFVAIGKSSLQPPLYVEISYYGANERTRPIVLIGKGVTFDSGSVDLKSSNALRHMRGDMAGAACVLAITRAAALLKLPVNIRGILPLCELMPSGKSPKFGDIVSSASGKSIHIRLPSREGRLLFADSLVYARNYWPKMILDIGTMSKELIYTLGGAACGCYTNSEELFCYAESASSQTGDRIWRMPLWKFYEERLKDCHVADLANTATNDYGDSPNCAAFLKQFVCDSQWMHFDTYNVSYTEGHDFTYLQKGMTGRPTRTIIELMYQLLGDAKL-