Monarch geneset OGS2.0

DPOGS202603
TranscriptDPOGS202603-TA2880 bp
ProteinDPOGS202603-PA959 aa
Genomic positionDPSCF300140 - 343958-353971
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0098680.084.65% 
BombyxBGIBMGA006348-TA0.076.24% 
Drosophilarobo-PB1e-8342.49% 
EBI UniRef50UniRef50_Q9W2132e-8142.49%Roundabout, isoform B n=19 Tax=Endopterygota RepID=Q9W213_DROME
NCBI RefSeqNP_726224.24e-8242.49%roundabout, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|3454969338e-8442.53%PREDICTED: roundabout homolog 2-like [Nasonia vitripennis]
NCBI nr blastxgi|3454969331e-8141.98%PREDICTED: roundabout homolog 2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055153.1e-07protein binding
KEGG pathwaymmu:198763e-81 
 K06753 (ROBO1)maps-> Axon guidance
InterPro domain[39-135] IPR0137831.7e-20Immunoglobulin-like fold
[557-657] IPR0089571.2e-15Fibronectin type III domain
[239-316] IPR0035991.1e-13Immunoglobulin subtype
[42-134] IPR0130983.6e-13Immunoglobulin I-set
[54-123] IPR0035982.9e-10Immunoglobulin subtype 2
[567-652] IPR0039613.1e-07Fibronectin, type III
Orthology groupMCL19165 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202603-TA
ATGGTGGATGTTGGTGGTAATGAATGTGATAACACCGTAACTGGGAGGAACCAATTTGTTTATTTAGTTTTTGTGCATTTATTACTTATTGGACTGAATGGAACAAATGCACAAAATCGGGCGCCACGAATCAAAGAACATCCATCGAACACGGTATCTGGTAGAAGTGAACCCGCCACATTGAGATGTGTGGTGGAAGGTCGACCCAAACCAACAGTTCAATGGTTTAAGGATGGGTTTCCGTTACCACCAGCGGAAGATGGACATAGAGTTCTTCTGGAAGACGGATTACTCTTTTTAAGGGTAAACCGAGGGAAAAAAGAAAGTGATGAGGGGGTTTACTGGTGTGTGGCTCGTAATATAGCAGGGGAAGCTGCAAGTCAGAATGCTTCATTAAATGTTGCAGTTCTACGCGATGACTTTAGAGTTGAACCAAGAGACGTTCAGGTAGCTGCAGGAGAACCAGCTTTATTAGAATGTATTCCACCGCGTGGCGTTCCAGAACCTTCAGTTCACTGGCTGAAAGACGGGCAGTTATATGACATCGAGGTTAATGGAAGAGTAAAAATAACAGAAACGGGAAGTTTAAAAATTCTTGAAACGTTACCAAACGACAGTGGTCTATTTAGATGTGTGGCTTCCAATATAGCAGGGGAGAGACAATCGAGGGCAGCGGCACTTATTGTTCTGAGACGACCGCATTTTGTGGTGAAACCCAGTAACGTTACTGCTTTAGTTGGACAAAACGTTGAATTCAATTGTCAGGCTTATGATAATAGCGTAAAAGTAACCTGGGTTAAGGAAGATGGTGTATTGCCATCGCACGCAACTATAATCCGGGGGCTGCTCCGCCTGGAGCAGGTATCAGCGAGCGATGCGGGTGTTTACTCCTGTCGGGCAGAGAGTCACACAGGGTCGAGTGTCACTACTGCTTCTTTAACTGTTTACTCCTTACCGCACTTCACACAGATACCGTCCAATTTGACCGCATGGGAGGGAGATGTCGTGTCCATTCCATGTGAAGCAAAAGGATTGCCAACACCTACAACATTCTGGATTTTGGAAGGAAATGAGGATACTCTTTTGTTTCCGGAATGTACTAAAAGTAATTCTAGTTTATTGTATCTGGACGGAGCGAAAGTAGAACACAGTGGAAGGGTTATCTGTGTTGCTTTAAACGCGGCTGGCAGTGTTATGCAAGAAGCTTATCTAAATGTGTTAAAGAAAACCAATAGTCCATCGAAAGTACATTCTACTGATAAAAAGTTTGGTCATAATCGTGATCACGACGTCCGTATGACGGAGTACGAATTTATTCAAGCAAGGAAATATTTACAACAAAATGTTTTAGTCTTGAAGAGAGTAGAAACTTTGTCTTCGACGTCTGTTAAAGTTGTCTGGGACGTTGTGACTGATTACAATGAGTATTTAGAAGGTGTTAAAATCTGGTATAATGGTACTTCATTGAATTGGCGCAATAGAACCGAAGAACCTGTCTTATTGAGCCCTCATCATAGGGATAGTTGTTCTAATGGTTCTCTGATAGACGTAGATAATAGTGGTTCGTCAAGTTATATTCTATCAGGCTTATTGCCATACACACAATACGATATTTTTCTAATGCCATATTACAAAATGTTGCTTGGAAAGCCATCAAATTCAATGACTGGTTATACTGATGAGGATGTACCGTCAGCCCCACCCCTAAGTGTAACAGCGGGTGTAATTAACGCAACTTCTGCTTGGATTCGATGGGAACCACCGCCTGTGCACACTTGGAATGGAGAACTGACGGGTTATTTAATTGAGATCCGGTCTGGTGGTACGGGTGGTCGTGTCGTGGGTCAGATGTCTTTAGGGCCTCGTACACGCGCGGCCGCCGCAAGTTCGTTGAGAGCTGGACAGTACAGCGCCCGTGCTGCTGCCACTACACGTAAAGGACACGGAGCTTACAGTGCGGCTGTCATAATAGACATGATGTACATGCATTCCCAGAGACATTATGTCCAAACTGAGCCGCCGAATGACGCTACTATTCCACATTTGCTCCAAGAAACTTGGTTGCTTGCATTAGCATTAACATTATTCTCCGTCATTGTAATTGGCATAGTTAGTATTTATTACATAAAACGTCGGAATAATATTCAAAGAAAGAAATCAAATGGTCAATCTATAGTGACAGCACAACAATGTCTGCTGAATAAAGACACGATATGGCTGAGAGATAGACCGATATTTGCACCTCCAGATTCTACTTTAGATGTAGGTAGTTGTCATCAAAGTTTATTGCAAGGTGGTCAATCCCCTAATATTTTAAATATTGAGCCAGAATATTGCCTACCACAGAATTCAATTCAAGGTTTAGATGCTTCGAGTGTTGATAGAATAAAACGCGGGCCACCTGAGCCGTATGCATCCAGCGCTATTTACACTGAACTCAATTTCAAGGACAACACAGACATCGGAGATGCCAGAAGTTGTACCGAAAGACGTTCTCATAACAACAGTATTGTGCGATCTTGCAATGGCAGCATCCAATATTCTAATGGAGAATGTTCGACTTGTTGCCATAGCACTTCTAGTAGATCAACCAAGGATTACAGGGAGTTAAGACACGAGAATCCAAACGAGCTAGTAGTTGAAGACGATAATGCGTCATATCATACTAATAGATCGGGTTCTAAGAGTCAGGACAAAGCCAAAGTTTGTCCAACAACTGATTTAGAATACGATTACCCACAGTGGCACTGGTTAGGAAGAGAGAACAGTTTCAAAATTCCCATACAACAGCTCCGTTCGGAGCGCGCCTGTCAAATAAATCTGAATGATATACTGCCGCCGCCGTACGAGAGTCCAGAAAACAAAAGCCAGATAAAATAG

Protein sequence:

>DPOGS202603-PA
MVDVGGNECDNTVTGRNQFVYLVFVHLLLIGLNGTNAQNRAPRIKEHPSNTVSGRSEPATLRCVVEGRPKPTVQWFKDGFPLPPAEDGHRVLLEDGLLFLRVNRGKKESDEGVYWCVARNIAGEAASQNASLNVAVLRDDFRVEPRDVQVAAGEPALLECIPPRGVPEPSVHWLKDGQLYDIEVNGRVKITETGSLKILETLPNDSGLFRCVASNIAGERQSRAAALIVLRRPHFVVKPSNVTALVGQNVEFNCQAYDNSVKVTWVKEDGVLPSHATIIRGLLRLEQVSASDAGVYSCRAESHTGSSVTTASLTVYSLPHFTQIPSNLTAWEGDVVSIPCEAKGLPTPTTFWILEGNEDTLLFPECTKSNSSLLYLDGAKVEHSGRVICVALNAAGSVMQEAYLNVLKKTNSPSKVHSTDKKFGHNRDHDVRMTEYEFIQARKYLQQNVLVLKRVETLSSTSVKVVWDVVTDYNEYLEGVKIWYNGTSLNWRNRTEEPVLLSPHHRDSCSNGSLIDVDNSGSSSYILSGLLPYTQYDIFLMPYYKMLLGKPSNSMTGYTDEDVPSAPPLSVTAGVINATSAWIRWEPPPVHTWNGELTGYLIEIRSGGTGGRVVGQMSLGPRTRAAAASSLRAGQYSARAAATTRKGHGAYSAAVIIDMMYMHSQRHYVQTEPPNDATIPHLLQETWLLALALTLFSVIVIGIVSIYYIKRRNNIQRKKSNGQSIVTAQQCLLNKDTIWLRDRPIFAPPDSTLDVGSCHQSLLQGGQSPNILNIEPEYCLPQNSIQGLDASSVDRIKRGPPEPYASSAIYTELNFKDNTDIGDARSCTERRSHNNSIVRSCNGSIQYSNGECSTCCHSTSSRSTKDYRELRHENPNELVVEDDNASYHTNRSGSKSQDKAKVCPTTDLEYDYPQWHWLGRENSFKIPIQQLRSERACQINLNDILPPPYESPENKSQIK-