Monarch geneset OGS2.0

DPOGS202945
TranscriptDPOGS202945-TA2073 bp
ProteinDPOGS202945-PA690 aa
Genomic positionDPSCF300195 - 434788-448448
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0108981e-9894.15% 
BombyxBGIBMGA005824-TA0.082.07% 
DrosophilaCorin-PB3e-12045.79% 
EBI UniRef50UniRef50_F4X8171e-13652.31%Atrial natriuretic peptide-converting enzyme n=6 Tax=Acromyrmex echinatior RepID=F4X817_ACREC
NCBI RefSeqXP_001814556.11e-15153.68%PREDICTED: similar to transmembrane protease, serine [Tribolium castaneum]
NCBI nr blastpgi|1892339122e-15053.68%PREDICTED: similar to transmembrane protease, serine [Tribolium castaneum]
NCBI nr blastxgi|1892339122e-16748.08%PREDICTED: similar to transmembrane protease, serine [Tribolium castaneum]
Group
Gene OntologyGO:00055151.9e-34protein binding
GO:00160206.8e-07membrane
GO:00050446.8e-07scavenger receptor activity
KEGG pathway 
InterPro domain[366-488] IPR0200671.9e-34Frizzled domain
[499-535] IPR0021727.1e-14Low-density lipoprotein (LDL) receptor class A repeat
[568-684] IPR0174486.8e-07Speract/scavenger receptor-related
Orthology groupMCL12467 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202945-TA
ATGAAAGCAAATGCGAGTAATCAATGGGGCTATGGCCGGGCCGGCAGCCTCGCGGTGACTCCGGGAGAGAAACTGATGGAACTACTTCGTCCCAACGGTGGGACACCACGACGACATTCGACAGCTGCATGCGCGCCTCCACAACCAAAACCCTCAGGTTTTGTGTACTGCCCTTCAGACGCCCTACCGTACTGTCCACCTTCTAGGTTTGCACCATCTCATCAAGTCAACAAGCTACCACCACCACAAGTTCAACAGCCGCCTCAACCACCACCAGTACCTCAGAGGACCGCGCCGGTGGTAACACCCTTGCCGGTTCCACCACCAGCACCACCGCGACGATCATCACCACAACCCCCACGCAGAGTACCACCTCCGACTCCCCCCCGGCCAAATGCAACGACAACAGATCCTAACGGCAACAAGCGACCACCGCCCGCACCACAGAAGGAACAAGCACCACCTAAACTCGACTGTAACAGAAACCCTCAACCACCAGCGATAAGAAGGTCTCCACAAAAAATGCCGGACACGCCATACACACCGGCACCTGTACCGCAAAATAATCAAATGAAACAATCACCGAGTTCCTCATCTAATATTAGCGTTAGACAAGATTCAAATGTATCCTCTGACTCATTTAGTCAGACTTCCTCTCCCTCTTATACTACAAAAACTATGGAAGCCCCGTTGCTACCGCACCAGCATGTCAATAAGAGCCTGAACGCCAAGATAGCTCGGGGACTATTACTCAAGGAGCAACAGGAAAAAGAAGCTGGGAACTCGTCCATTACAAAGAGCATGTCCACTCCGGCTTCACTGCAAACCATAGTGAGATTTCAAAACGGAAGCAACATGTCTCTACATCACAGAATGCTTCGTGATATGCGCGGCACCAATACCGACACCTCCGCTCACAAGTTTCGTCTGATGCAGCTCGCTCTCAACGCGGTCATGCTGCTAGCTATCACCGGAGCTTTGTTCGCATACTTTAAAGCGAACCCCGCTGTTCAGTATGTGTCTCAAGCTGTGAATTTGTCGGCGGCGGTGACGTGGCCGACGCCAACGGAGCCCCCGGGTGCAAGGAACCCAGCGCCAGGAGTCTGTCTGCCTGTCATCGTTACCTTCTGTCACCAGCACCGCATCTCCTACAATTTCACTGTTTTTCCAAATTACATTGGACACTTCGGACAGAGGGATGCACAACAGGACTTGGAAATTTACGATGCAGTTGTGGACGTTCGCTGCTATGAACTGACGGCGCTTTTTCTGTGTTCTTTATTTGTCCCCAAGTGCGGTCCACTGGGTCACATGGTCCGACCCTGTCGGAGTCTGTGTCAAGAAACAATGCGTCGTTGTGGTTTCTTTCTGGAAGTATTCGGTCTTTCGATGCCGGACTACCTTCAATGTGAAATTTTTCCGGAGTCTACTGACAGAGACGTGTGCCTTGGGAATAGAGAAGTAAAGGAGGCGCGATTTAGAGCTGCAAAACCAGTGTGTCCAACTGGTTTCCAATGCGACATGAACCGCTGCATCCCCCACGACTGGCGCTGTGATGGACACGTGGACTGCGCTGATCGCTCCGACGAACTGAACTGTCGCGTCTGTAAAAGAAGCGGAGACGTCCACTGCGGAAACCAGAGATGTATCTCACAGGCACATCTGTGTGACGGGAAGATAGACTGCCCCTGGGGACAGGATGAAAGGAACTGCTTACGTCTAAGTAAGGCAAACGGTGACGTTGGTCGTGGGGAGCTCCAAGTATATCGCGCCGCCAACCAGTCCTGGTTCCCAGCTTGCATCACCACCTTGGACGATCCAACTGCTTCCAAACTGTGCTCAATGCTCGGATACTCTTGGGTTAACAAGAGTACGGTGGTGGGTGGTGCGGGCGCACGAGCCGGAAGCGGGGTGCAGGCTCATGGCGTGGCGCAGTCCTACCGAGCCTTCCAACGGAGCGAGGGAGGTCTATTGCGGGAGCTCAAAGACTGTCGCCACGACTCAGCCAGGGTGCACCTCGTCTGTGATCATTACGGTCCATCATATCCTCGGAAGAGGTTCTTCGGAGATTAA

Protein sequence:

>DPOGS202945-PA
MKANASNQWGYGRAGSLAVTPGEKLMELLRPNGGTPRRHSTAACAPPQPKPSGFVYCPSDALPYCPPSRFAPSHQVNKLPPPQVQQPPQPPPVPQRTAPVVTPLPVPPPAPPRRSSPQPPRRVPPPTPPRPNATTTDPNGNKRPPPAPQKEQAPPKLDCNRNPQPPAIRRSPQKMPDTPYTPAPVPQNNQMKQSPSSSSNISVRQDSNVSSDSFSQTSSPSYTTKTMEAPLLPHQHVNKSLNAKIARGLLLKEQQEKEAGNSSITKSMSTPASLQTIVRFQNGSNMSLHHRMLRDMRGTNTDTSAHKFRLMQLALNAVMLLAITGALFAYFKANPAVQYVSQAVNLSAAVTWPTPTEPPGARNPAPGVCLPVIVTFCHQHRISYNFTVFPNYIGHFGQRDAQQDLEIYDAVVDVRCYELTALFLCSLFVPKCGPLGHMVRPCRSLCQETMRRCGFFLEVFGLSMPDYLQCEIFPESTDRDVCLGNREVKEARFRAAKPVCPTGFQCDMNRCIPHDWRCDGHVDCADRSDELNCRVCKRSGDVHCGNQRCISQAHLCDGKIDCPWGQDERNCLRLSKANGDVGRGELQVYRAANQSWFPACITTLDDPTASKLCSMLGYSWVNKSTVVGGAGARAGSGVQAHGVAQSYRAFQRSEGGLLRELKDCRHDSARVHLVCDHYGPSYPRKRFFGD-