Monarch geneset OGS2.0

DPOGS209125
TranscriptDPOGS209125-TA4704 bp
ProteinDPOGS209125-PA1567 aa
Genomic positionDPSCF300501 - 43891-56187
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0160620.060.03% 
BombyxBGIBMGA003286-TA0.047.04% 
Drosophilafra-PA2e-8632.61% 
EBI UniRef50UniRef50_F4W5X35e-15731.60%Neogenin n=9 Tax=Formicidae RepID=F4W5X3_ACREC
NCBI RefSeqXP_001122444.12e-15130.69%PREDICTED: similar to Neogenin precursor [Apis mellifera]
NCBI nr blastpgi|3071956362e-16232.26%Neogenin [Harpegnathos saltator]
NCBI nr blastxgi|3838548581e-16332.36%PREDICTED: LOW QUALITY PROTEIN: neogenin-like [Megachile rotundata]
Group
Gene OntologyGO:00055151.3e-13protein binding
GO:00160212.7e-11integral to membrane
KEGG pathwayoaa:1000766068e-99 
 K06766 (NEO1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[425-491] IPR0137831e-26Immunoglobulin-like fold
[558-651] IPR0089576.5e-19Fibronectin type III domain
[293-400] IPR0130986.2e-14Immunoglobulin I-set
[756-826] IPR0039611.3e-13Fibronectin, type III
[1207-1271] IPR0105602.7e-11Neogenin, C-terminal
[417-481] IPR0035983.3e-11Immunoglobulin subtype 2
[299-401] IPR0035998.2e-10Immunoglobulin subtype
Orthology groupMCL10653 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209125-TA
ATGCTGTCCTGCGGAGCCGCGGCCCCCGCCAGGCTCAGTTGGAGGTACAGCGCCTCCGCCCCTCCGACCAGAGACCACAGCCTCCCACGAGCTGATAGCTTTAGGAAACAGTTGACGAACGGCTCCCTGCTTATAGAGAGAATGTCACCACCACTGGCCGGTCAGTACCAATGTGTGGCGACGGTAGATGGTATAGGTACCGTCGTGTCGCGAGTGGCCACTGTCTTCCTCGCTGAAGTGCCAGAGTTCCTGGAGGGTCCTCGTACTATGTCTGGTGTGCTCGGCTCCCCAGCACTCCTCCCGTGTTCCCTTAAGCTACCACTGCGTGTGGCTGTGAGGGTGATTGCTGCAGTCACTGAGAGGAGAGTGTATGGACCAAACAAGATACATGCACCGCCTCCTGTGTTGAAACTCAATGTGACATGGCTAAAGAACGGTTCGCCGGTCCAAGTGGAGGCCGCTCGTTTATACAGCACTGTCAGCGGAGCCCTGGAGATAGACCCTCTGAGGACGCACGACGCCGCCACATACAGATGTGCCGTCAGCCTCGCGCACACCAACAAACCACCAGTTCTGGGTCCGGAGATCGATCTCCGCGTGAACAGCGAGCTGGCGGGGATGGAGTCCGCGCCCAGGATACTCACCACGCCGCAGCCTGTCACGGTGACATGGCTAAAGAACGGTTCGCCGGTCCAAGTGGAGGCCGCTCGTTTATACAGCACTGTCAGCGGAGCCCTGGAGATAGACCCTCTGAGGACGCACGACGCCGCCACATACAGATGTGCCGTCAGCCTCGCGCACACCAACAAACCACCAGTTCTGGGTCCGGAGATCGATCTCCGCGTGAACAGCGAGCTGGCGGGGATGGAGTCCGCGCCCAGGATACTCACCACGCCGCAGCCTGTCACGGTCATAGAAGGCGCGTCCGTGACCTTCGACTGTGCGGCCACCGGCAACCCCAAGCCGGAGATCGTGTGGCTCAACAACGGCGTGGCCATAGACTTGAAGTACGTTCATAACCACCTCTTTACTCAGATCTGTTCCATGAGGTCTGTGTGTAGTGATCTAGACTCCCGTTTCTATCTGGTGGGGGGTGGCTCTCTCCGCGTGGTGTCGTCCCGGGCTCCGGACGCGGGCGCCTACACCTGCCGCGCCACCAACAGGATCGACGCCGCCGACCACTCCGCTCACCTCCACGTCCTGTCTCCCCCCCGCGTGTCGGTCCGCGACGGGTCGGTGGTGAGGGCGGTGACCCGCGGTGATGTCACTCTGAGATGCGACGCCCGCGGGCGACCGCCGCCAGTGGTGACGTGGCTGAAAGACGGGGAACCTCTCACACCGAACAACCACGACATCATGGTGGACGGGACCTCGCTGAGGATCAGGGGAGTGCTGGACGTGGACGAGGGAGTGTTCCAGTGTGTCGCGGCCTCGGCTGCCGGCAGCGCTGCCGCGGCGCTCAGGCTGATTGTGGCGCCGCACGCCGACCCCCTCCCCACGAACCTGACCCCTACCTTCCTGACCCCCGACCTCTACCCGGAAGATGTGGATTTCATCGGCGAGACGTCATCAGCGTTCACCCCCGAGCCTCTGTACGACGACTTAGATAACGTAGATTATTCCGAAGATCTGGACTCCTACGACGCGGGCAAGGGGAACGCGAGCGTGGTCTCCGCGCCCGGGGACTTCCGCGCCGTCATCGTCAAGCACCGCTTCGTGACGCTCAGCTGGACCGAGCCGAAACACGCGCTGGAAGAAGTCACCGGATACATCATACTGTATAAAGTGAAAGGAAGCGACCGGGAGCGTCTGTGGTCGGGCGAGGCTCGGCGGCGCGAGGCCGTGTTGGCGTCCCTGGCTCCTCGCACCACGTACACGGCGCGGGCCCTCGCCCTCACTCGCAGTGCAGCCTCGCCGCCGACAGAGACTATAGAGGTGACCACTCCTGACGAGGAGCTGTCCTACGGCCCTCCTCAGAACGTGTCGGTGGAGGCGGTGGGCGCTCACTCTCTGCGGGTGTGGTGGGCCCCGCCCGCGCCGCTCGGGCCTCACGTGCCGCCTGAGGTGCCGCCCGCCGCTCCTGGCCGATACGTCATATACTATACAGAGACGGAGAGTGGTCGCGAGCAGAGCCAGTACACCAACTCCACCAGCATCACCCTGAGCGGTCTGCGGGCGGCCACCGCCTACCGGGTGCGGGTGTCGGCGGGGGGAGGGGGGACCAGTGACGTCACCACCGCCACGCGAGCCGACGCCCCCTCCGCACCACCCACTGACGTCACCGTCATCCCCGCCACGGATACGTCGCTACTAGTCCGCTGGTCGGCCCCGGCCGGGCGCTCGCACCGCGGAGCCCTCACAGGATACAAGCTCCGGTACAGGACCCCCGGGGCGCGCCGCGCGGACTCGCTCACCACTCCCGCAGACACCACGCGGGCGGACCTCACAGGACTGGAACCCTCCACCACCTACCAGCTCGCGCCGGTCGCGACTGGATCTCCGTCTGGTGGGGCAGCGAGGAGGGCTCCAATACACCGGGGGGAGGGGGCCCGGGGGCAGGCGGGGCGGGCGGCGGAGCACCCGTGCGGGGGTACTGGCTGGGCTGGGGACTCGGCGTACCTGACTCACACTCCAGGGAACTGCCTGCGCATGCGCATTCACATCCTGCATCATGTCCGTGACCCAGAATCCAACTCGGAGTACGTGATATCTCTCCGCGCCAGCAACACGCTGGGTCTGGGCCCGGCGGTGTACGCGACCGTCCGCACCAAGCCCGACGACGGAGAAGACGAACCAGACGAGCCCGACCAGCCGGAGGACGACGCGCCGCCTCTCATCCCGCCCGTGGGGCTCAAGGTTATCATGCTGAGCGGCACCACCGCCGTCGTGTACTGGACCGACCCCACTCTACCCAAAGGACAGACGGCGGCGGACGGTCGACGGTACGCGGTGCGGTGGTCGGGCGGCGGACGGTCGCGCGTCTACAACGCTTCGGATCTTAATCTCATGTTGGACGATCTCAAACCTTACACGCACTACGAGTTCGCTGTCAAACTCATAAAAGGTGGTCGCGAATCTCCCTGGTCGATGCTGGCCAGCAACACGTCCCTGGAGGCAGCCCCGGGCTCCGCCCCCCGGGAGCTGCGCGTGTCCCCCGCGGCTCCAGCCTCCCGCGCCGCCGACTTAACGTGGAGTCCACCCGCCAAGCCCAACGGAGTCATCACAGGCATGTTGACACGGTATGTGATAATGTACGGCGTGTCCCGCGGGTCGGGCGCCGCCGAGGAGTGGTCGGCTCTGGCGGCTCCCGGGGAACGAGGCCGGGCCCGGGTGGACCGGCTCCGGGCGCGGACCACCTACAGCTTCAAGATACAGGCACGGAACAGTAGGGGACTGGGGCCCTTCAGCCCCGCCGTCACTTACACTACTGGGATTGAGAGCGGTGAAGGCGCGGGTCTGGCGAGCGCCACGTCCGCGTGGTTGTGGGCCAGTGCGGGAGGCGCCTGTGCCGTGCTGGCGCTCGCAGCAGCCCTCGCTCTGTCGCTGTGCTGCAGAAGGAACACGCCTCCCATGTCCCCGGACACCAGCACTTACCAGAAAGCGTCCGCGTCAGCTGGCATCAAGCCTCCAGACCTCTGGATCCACCACGACCAGATGGAGCTGAAGCACATGGACAAGAGCTTACACAGCTCAGCCAGTAAGATATCAGCGGGTAGCGTCGAAGGCAGCGCGTTGGTGTCGTCGACGTTGACCCTATCCCGCACCCCCCACCCCCACTCTCACCCTCACCCCTCCCAACATACGTCGGGTGTTTGGAGTAAGATGAACGAGATTGTTGGTCTTACAAACATTCGTTACGCCAATCGCCCTGTATATGTATACTTTGTGTTGCTTCCTCCCCTCCCTCCCGTGTATCCTGTCTGTATGTCTGTCTGTGTGCACGTAGACCGTCGCTCGTCGTCGGGCAGCGCGGACACCGCCCCGCTCCGAGCCTCGCCCCTCGACTACCGCTGCGACCTGCTAGCAATGACAGGGGTCGGCGTGGGCGTGGGCGTGGGCTGCACCGGCACCTGCGAGCGACGGAGACATCTAGCGGATCAGAGCACGCCGCTGTTGACGGGTGTAGCGCCGCTGGGGTCGCCGCAGTCCTCTCTGACGTCACATCCGCCCGCACCATGCGTGTCGGGTCAGTGTCCTCTGGGTACATGCTCGGCGCCAGGGTCCGAGGTGTACGCGAGCGCGTCCACAGCGCGGGAGCGAGGACACTACGTCGCCTACGAACCCCTGGGACATTACACGCACCGTGACTCCGTGAGTACGGACGCCGCAGCACCCGCGAGCACGGGCGCCGGAGGCTCCCTACAGAGGAGGGGCGCGTGCTCCGCCCTACACAGCTTCACGCTACCCGACAACGCGTCTGATCACAGCACGCCCTCACACTCCAAGGGAAGTGCGCGCGCGTCCTCCCCGTACAAGGCGAGTGCGTCGTCGTCCCCCGCACACACACACACGCACTCGCTCGCGCACACACACGCGCATGCTCTCAACAGACTGCAGCTCGGTGGTGGAGTGTCTCACAGCTCTGATGAGCTGGAGCCTCTCACTCCGTCCAGGTCCAGCGAGCGTCTCCACCGCGAGATGCAGAACCTGGAAGGGCTCATGAAGGACCTGTCGGCGATCACGCAGAACCAGTTCCACTGCTAG

Protein sequence:

>DPOGS209125-PA
MLSCGAAAPARLSWRYSASAPPTRDHSLPRADSFRKQLTNGSLLIERMSPPLAGQYQCVATVDGIGTVVSRVATVFLAEVPEFLEGPRTMSGVLGSPALLPCSLKLPLRVAVRVIAAVTERRVYGPNKIHAPPPVLKLNVTWLKNGSPVQVEAARLYSTVSGALEIDPLRTHDAATYRCAVSLAHTNKPPVLGPEIDLRVNSELAGMESAPRILTTPQPVTVTWLKNGSPVQVEAARLYSTVSGALEIDPLRTHDAATYRCAVSLAHTNKPPVLGPEIDLRVNSELAGMESAPRILTTPQPVTVIEGASVTFDCAATGNPKPEIVWLNNGVAIDLKYVHNHLFTQICSMRSVCSDLDSRFYLVGGGSLRVVSSRAPDAGAYTCRATNRIDAADHSAHLHVLSPPRVSVRDGSVVRAVTRGDVTLRCDARGRPPPVVTWLKDGEPLTPNNHDIMVDGTSLRIRGVLDVDEGVFQCVAASAAGSAAAALRLIVAPHADPLPTNLTPTFLTPDLYPEDVDFIGETSSAFTPEPLYDDLDNVDYSEDLDSYDAGKGNASVVSAPGDFRAVIVKHRFVTLSWTEPKHALEEVTGYIILYKVKGSDRERLWSGEARRREAVLASLAPRTTYTARALALTRSAASPPTETIEVTTPDEELSYGPPQNVSVEAVGAHSLRVWWAPPAPLGPHVPPEVPPAAPGRYVIYYTETESGREQSQYTNSTSITLSGLRAATAYRVRVSAGGGGTSDVTTATRADAPSAPPTDVTVIPATDTSLLVRWSAPAGRSHRGALTGYKLRYRTPGARRADSLTTPADTTRADLTGLEPSTTYQLAPVATGSPSGGAARRAPIHRGEGARGQAGRAAEHPCGGTGWAGDSAYLTHTPGNCLRMRIHILHHVRDPESNSEYVISLRASNTLGLGPAVYATVRTKPDDGEDEPDEPDQPEDDAPPLIPPVGLKVIMLSGTTAVVYWTDPTLPKGQTAADGRRYAVRWSGGGRSRVYNASDLNLMLDDLKPYTHYEFAVKLIKGGRESPWSMLASNTSLEAAPGSAPRELRVSPAAPASRAADLTWSPPAKPNGVITGMLTRYVIMYGVSRGSGAAEEWSALAAPGERGRARVDRLRARTTYSFKIQARNSRGLGPFSPAVTYTTGIESGEGAGLASATSAWLWASAGGACAVLALAAALALSLCCRRNTPPMSPDTSTYQKASASAGIKPPDLWIHHDQMELKHMDKSLHSSASKISAGSVEGSALVSSTLTLSRTPHPHSHPHPSQHTSGVWSKMNEIVGLTNIRYANRPVYVYFVLLPPLPPVYPVCMSVCVHVDRRSSSGSADTAPLRASPLDYRCDLLAMTGVGVGVGVGCTGTCERRRHLADQSTPLLTGVAPLGSPQSSLTSHPPAPCVSGQCPLGTCSAPGSEVYASASTARERGHYVAYEPLGHYTHRDSVSTDAAAPASTGAGGSLQRRGACSALHSFTLPDNASDHSTPSHSKGSARASSPYKASASSSPAHTHTHSLAHTHAHALNRLQLGGGVSHSSDELEPLTPSRSSERLHREMQNLEGLMKDLSAITQNQFHC-