Monarch geneset OGS2.0

DPOGS206226
TranscriptDPOGS206226-TA4203 bp
ProteinDPOGS206226-PA1400 aa
Genomic positionDPSCF300334 - 367-11032
RNAseq coverage13x (Rank: top 83%)
Annotation
HeliconiusHMEL0160620.068.10% 
BombyxBGIBMGA003286-TA0.055.13% 
Drosophilafra-PA2e-10634.87% 
EBI UniRef50UniRef50_F4W5X30.034.78%Neogenin n=9 Tax=Formicidae RepID=F4W5X3_ACREC
NCBI RefSeqXP_974501.20.034.34%PREDICTED: similar to AGAP006083-PB [Tribolium castaneum]
NCBI nr blastpgi|3071956360.035.82%Neogenin [Harpegnathos saltator]
NCBI nr blastxgi|3838548580.035.09%PREDICTED: LOW QUALITY PROTEIN: neogenin-like [Megachile rotundata]
Group
Gene OntologyGO:00055151.5e-19protein binding
GO:00160215.3e-11integral to membrane
KEGG pathwayoaa:1000766062e-136 
 K06766 (NEO1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[680-787] IPR0089573.9e-29Fibronectin type III domain
[681-787] IPR0137837.5e-25Immunoglobulin-like fold
[693-777] IPR0039611.5e-19Fibronectin, type III
[230-337] IPR0130985.4e-14Immunoglobulin I-set
[354-418] IPR0035983.3e-11Immunoglobulin subtype 2
[1182-1208] IPR0105605.3e-11Neogenin, C-terminal
[236-338] IPR0035998.2e-10Immunoglobulin subtype
Orthology groupMCL10653 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206226-TA
ATGTTCAATCTTACGACGGAGCCGGCGGATGTGGTGGTGGTGGAAGGGGAGAGCGCCATGCTGTCCTGCGGAGCCGCGGCCCCCGCCAGGCTCAGTTGGAGGTACAGCGCCTCCGCCCCTCCGACCAGAGACCACAGCCTCCCACGAGCTGATAGCTTTAGGAAACAGTTGACGAACGGCTCCCTGCTCATAGAGAGAATGTCACCACCACTGGCCGGTCAGTACCAATGTGTGGCGACGGTAGATGGTATAGGTACCGTCGTGTCGCGAGTGGCCACTGTCTTCCTCGCTGAAGTGCCAGAGTTCCTGGAGGGTCCTCGTACTATGTCTGGTGTGCTCGGCTCCCCAGCACTCCTCCCGTGTTCCCTTAAGCTACCACTGCGTGTGGCTGTGAGGGTGATTGCTGCAGTCACTGAGAGGAGAGTGTATGGACCAAACAAGATACATGCACCGCCTCCTGTGTTGAAACTCAATGTGACATGGCTAAAGAACGGTTCGCCGGTCCAAGTGGAGGCCGCTCGTCTATACAGCACCGTCAGCGGAGCCCTGGAGATAGACCCTCTGAGGACGCACGACGCCGCCACATACAGATGTGCCGTCAGCCTCGCGCACTCCAACAAACCACCAGTTCTGGGTCCGGAGATCGATCTCCGCGTGAACAGCGAGCTGGCGGGGATGGAGTCCGCGCCCAGGATACTCACCACGCCGCAGCCCGTCACGGTCATAGAAGGCGCGTCCGTGACCTTCGACTGTGCGGCCACCGGCAACCCCAAGCCGGAGATCGTGTGGCTCAACAACGGCGTGGCCATAGACTTGAAGTACGTTCATAACCACCTCTTTACTCAGATCTGTTCCATGAGGTCTGTGTGTAGTGATCTAGACTCCCGTTTCTATCTGGTGGGGGGCGGCTCTCTCCGCGTGGTGTCGTCCCGGGCTCCGGACGCGGGCGCCTACACCTGCCGCGCCACCAACAGGATCGACGCCGCCGACCACTCCGCTCACCTCCACGTCCTGTCTCCCCCCCGCGTGTCGGTCCGCGACGGGTCGGTGGTGAGGGCGGTGACCCGCGGTGATGTCACTCTGAGATGCGACGCCCGCGGGCGACCGCCGCCAGTGGTGACGTGGCTGAAAGACGGGGAACCTCTCACACCGAACAACCACGACATCATGGTGGACGGGACCTCGCTGAGGATCAGGGGAGTGCTGGACGTGGACGAGGGAGTGTTCCAGTGTGTCGCGGCCTCGGCTGCCGGCAGCGCTGCCGCGGCGCTCAGGCTGATTGTGGCGCCGCACGCCGACCCCCTCCCCACGAACCTGACCCCTACCTTCCTGACCCCCGACCTCTACCCGGAAGATGTGGATTTCATCGGCGAGACGTCATCAGCGTTCACCCCCGAGCCTCTGTACGACGACTTAGATAACGTAGATTATTCCGAAGATCTGGACTCCTACGACGCGGGCAAGGGGAACGCGAGCGTGGTCTCCGCGCCCGGGGACTTCCGCGCCGTCATCGTCAAGCACCGCTTCGTGACGCTCAGCTGGACCGAGCCGAAACACGCGCTGGAAGAAGTCACCGGATACATCATACTGTATAAAGTGAAAGGAAGCGACCGGGAGCGTCTGTGGTCGGGCGAGGCTCGGCGGCGCGAGGCCGTGTTGGCGTCCCTGGCTCCTCGCACCACGTACACGGCGCGGGCCCTCGCCCTCACTCGCAGTGCAGCCTCGCCGCCGACAGAGACTATAGAGGTGACCACTCCTGACGAGGAGCTGTCCTACGGCCCTCCTCAGAACGTGTCGGTGGAGGCGGTGGGCGCTCACTCTCTGCGGGTGTGGTGGGCCCCGCCCGCGCCGCTCGGGCCTCACGTGCCGCCTGAGGTGCCGCCCGCCGCTCCTGGCCGATACGTCATATACTATACAGAGACGGAGAGTGGTCGCGAGCAGAGCCAGTACACCAACTCCACCAGCATCACCCTGAGCGGTCTGCGGGCGGCCACCGCCTACCGGGTGCGGGTGTCGGCGGGGGGAGGGGGGACCAGTGACGTCACCACCGCCACGCGAGCCGACGCCCCCTCCGCACCACCCACTGACGTCACCGTCATCCCCGCCACGGATACGTCGCTACTAGTCCGCTGGTCGGCCCCGGCCGGGCGCTCGCACCGCGGAGCCCTCACAGGATACAAGCTCCGGTACAGGACCCCCGGGGCGCGCCGCGCGGACTCGCTCACCACTCCCGCAGACACCACGCGGGCGGACCTCACAGGACTGGAACCCTCCACCACCTACCAGGTCCGCGTGTGTGCTCTGAACGCTAACGGCTCCGGACCGTTCAGCGAGTGGGTCTCGGCCACCACGCAGCCAAGACGGAGGCCTGAGAGCAGCGTGCCGGCGGCGCCGCCTCCGCTCACCACTCGCGCCGGTCGCGACTGGATCTCCGTCTGGTGGGGCAGCGAGGAGGGCTCCAACACACCGGGGGGAGGGGGCCCGGGGGCAGTCGGGGCGGGCGGCGGAGCACCCGTGCGGGGGTACTGGCTGGGCTGGGGACTCGGCGTACCTGACTCACACTCCAGGGAACTGCCTGCGCATGCGCATTCACATGTTATAAGAGATTTGGAATCCAACTCGGAGTACGTGATATCTCTCCGCGCCAGCAACACGCTGGGTCTGGGCCCGGCGGTGTACGCGACCGTCCGCACCAAGCCCGACGACGGAGAAGACGAACCAGACGAGCCCGACCAGCCGGAGGACGACGCGCCGCCTCTCATCCCGCCCGTGGGGCTCAAGGTTATCATGCTGAGCGGCACCACCGCCGTCGTGTACTGGACCGACCCCACTCTACCCAAAGGACAGACGGCGGCGGACGGTCGACGGTACGCGGTGCGGTGGTCGGGCGGCGGACGGTCGCGCGTCTACAACGCTTCGGATCTCAATCTCATGTTGGACGATCTCAAACCTTACACGCACTACGAGTTCGCTGTCAAACTCATAAAAGGTGGTCGCGAATCTCCCTGGTCGATGCTGGCCAGCAACACGTCCCTGGAGGCAGCCCCGGGCTCCGCCCCCCCGGGAGCTGCGCGTGTCCCCCGCGGCTCCAGCCTCCGCGCCGCCGACTTAACGTGGAGTCCACCCGCCAAGCCCAACGGAGTCATCACAGGCATGTTGACACGGTATGTGATAATGTACGGCGTGTCCCGCGGGTCGGGCGCCGCCGAGGAGTGGTCGGCTCTGGCGGCTCCCGGGGAACGAGGCCGGGCCCGGGTGGACCGGCTCCGGGCGCGGACCACCTACAGCTTCAAGATACAGGCACGGAACAGTAGGGGACTGGGGCCCTTCAGCCCCGCCGTCACTTACACTACTGGGATTGAGAGCGGTGAAGGCGCGGGTCTGGCGAGCGCCACGTCCGCGTGGTTGTGGGCCAGTGCGGGCGGCGCCTGTGCCGTGCTGGCGCTCGCAGCAGCCCTCGCTCTGTCGCTGTGCTGCAGAAGGAACACGCCTCCCATGTCCCCAGACACCAGCACTTACCAGAAAGCGTCCGCGTCAGCTGGCATCAAGCCTCCAGACCTCTGGATCCACCACGACCAGATGGAGCTGAAGCACATGGACAAGAGCTTACACAGCTCAGCCATATCAGCGGGTAGCGTCGAAGGCAGCGCGTTGGTGTCGTCGACGTTGACCCTATCCCGCACCCCCACCCCACTCGCACCCTCACCCCACCCCCGCTGCCCGCGCTCCCCGCCCCCCACCTGGCGGAGTACGAGCCGGCGCGCCATCCGCCGCCCATCACCAGCCTCGACCGGCGATACGTCCCAACATACGTCGGGTGTTTGGAGTAAGATGAACGAGATTGTTGGTCTTACAAACATTCGTTACGCCAATCGCCCTGTATATGTATACTTTGTGTTGCTTCCTCCCCTCCCTCCCGTGTATCCTGTCTGTATGTCTGTCTGTGTGCACGTAGACCGTCGCTCGTCGTCGGGCAGCGCGGACACCGCCCCGCTCCGAGCCTCGCCCCTCGACTACCGCTGCGACCTGCTAGCTATGACTGGTGTCGGAGTGGGCGTGGGCGTGGGCTGCACCGGCACCTGCGAGCGACGGAGACATCTGGCGGATCAGAGCACGCCGCTGTTGACGGGTGTAGCGCCGCTGGGGTCGCCGCAGTCCTCTCTGACGTCACATCCGCCCGCGCCATGTAAGTATCCACACTGA

Protein sequence:

>DPOGS206226-PA
MFNLTTEPADVVVVEGESAMLSCGAAAPARLSWRYSASAPPTRDHSLPRADSFRKQLTNGSLLIERMSPPLAGQYQCVATVDGIGTVVSRVATVFLAEVPEFLEGPRTMSGVLGSPALLPCSLKLPLRVAVRVIAAVTERRVYGPNKIHAPPPVLKLNVTWLKNGSPVQVEAARLYSTVSGALEIDPLRTHDAATYRCAVSLAHSNKPPVLGPEIDLRVNSELAGMESAPRILTTPQPVTVIEGASVTFDCAATGNPKPEIVWLNNGVAIDLKYVHNHLFTQICSMRSVCSDLDSRFYLVGGGSLRVVSSRAPDAGAYTCRATNRIDAADHSAHLHVLSPPRVSVRDGSVVRAVTRGDVTLRCDARGRPPPVVTWLKDGEPLTPNNHDIMVDGTSLRIRGVLDVDEGVFQCVAASAAGSAAAALRLIVAPHADPLPTNLTPTFLTPDLYPEDVDFIGETSSAFTPEPLYDDLDNVDYSEDLDSYDAGKGNASVVSAPGDFRAVIVKHRFVTLSWTEPKHALEEVTGYIILYKVKGSDRERLWSGEARRREAVLASLAPRTTYTARALALTRSAASPPTETIEVTTPDEELSYGPPQNVSVEAVGAHSLRVWWAPPAPLGPHVPPEVPPAAPGRYVIYYTETESGREQSQYTNSTSITLSGLRAATAYRVRVSAGGGGTSDVTTATRADAPSAPPTDVTVIPATDTSLLVRWSAPAGRSHRGALTGYKLRYRTPGARRADSLTTPADTTRADLTGLEPSTTYQVRVCALNANGSGPFSEWVSATTQPRRRPESSVPAAPPPLTTRAGRDWISVWWGSEEGSNTPGGGGPGAVGAGGGAPVRGYWLGWGLGVPDSHSRELPAHAHSHVIRDLESNSEYVISLRASNTLGLGPAVYATVRTKPDDGEDEPDEPDQPEDDAPPLIPPVGLKVIMLSGTTAVVYWTDPTLPKGQTAADGRRYAVRWSGGGRSRVYNASDLNLMLDDLKPYTHYEFAVKLIKGGRESPWSMLASNTSLEAAPGSAPPGAARVPRGSSLRAADLTWSPPAKPNGVITGMLTRYVIMYGVSRGSGAAEEWSALAAPGERGRARVDRLRARTTYSFKIQARNSRGLGPFSPAVTYTTGIESGEGAGLASATSAWLWASAGGACAVLALAAALALSLCCRRNTPPMSPDTSTYQKASASAGIKPPDLWIHHDQMELKHMDKSLHSSAISAGSVEGSALVSSTLTLSRTPTPLAPSPHPRCPRSPPPTWRSTSRRAIRRPSPASTGDTSQHTSGVWSKMNEIVGLTNIRYANRPVYVYFVLLPPLPPVYPVCMSVCVHVDRRSSSGSADTAPLRASPLDYRCDLLAMTGVGVGVGVGCTGTCERRRHLADQSTPLLTGVAPLGSPQSSLTSHPPAPCKYPH-