Monarch geneset OGS2.0

DPOGS211939
TranscriptDPOGS211939-TA3609 bp
ProteinDPOGS211939-PA1202 aa
Genomic positionDPSCF300011 + 557156-563447
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0128120.088.69% 
BombyxBGIBMGA000888-TA0.083.06% 
Drosophilarobo3-PA0.046.60% 
EBI UniRef50UniRef50_Q174G30.044.99%Roundabout n=9 Tax=Pancrustacea RepID=Q174G3_AEDAE
NCBI RefSeqXP_970268.10.045.78%PREDICTED: similar to roundabout [Tribolium castaneum]
NCBI nr blastpgi|910895050.045.78%PREDICTED: similar to roundabout [Tribolium castaneum]
NCBI nr blastxgi|910895050.045.82%PREDICTED: similar to roundabout [Tribolium castaneum]
Group
Gene OntologyGO:00055151.5e-11protein binding
KEGG pathwaygga:3952915e-146 
 K06754 (ROBO2)maps-> Axon guidance
InterPro domain[124-192] IPR0137835.7e-25Immunoglobulin-like fold
[489-604] IPR0089571.6e-21Fibronectin type III domain
[115-182] IPR0035985.3e-16Immunoglobulin subtype 2
[201-284] IPR0130982.2e-14Immunoglobulin I-set
[203-285] IPR0035991.5e-11Immunoglobulin subtype
[499-591] IPR0039611.5e-11Fibronectin, type III
[302-363] IPR0131516.7e-07Immunoglobulin
Orthology groupMCL10143 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211939-TA
ATGAGCAGTGGGGAAGCGCCGTATATTATAGAACAACCAATAGATGTCACTGTGGCTCGCCATCAACCTGCAACATTAAACTGCCGAGCTGGCGGTTCACCCACTCCTACCATTCGGTGGTATAAAAGCGGTGTCCTTGTTGTATCGGACACCCATAGAAGCCTTTTACCTGCCGGCGGTTTATTCTTTCTGAGAGCAACCCATGGAAGGGCACATTCAGATGCGGGAGTGTACTGGTGTGAAGCCACCAATCCATATGGCACAGCTAGAAGTCGCAATGCTACACTTCACGTCGCTGTTCTTCGCGAAGAATTTAGACTAGAACCAGTAACTTCACAAACAGCTCAAGGAGAAACAGTAATACTAGAATGTTCACCACCGCGTGGCTCCCCTGAGCCCACAGTCTATTGGAAGAAAGATGGACAAATTTTACATTTCGACGGAGATTCAAGAATGCATTTAGTAGATGGAGGTAGTCTTGTAATACAAGATGCTACGCAAACTGACGCTGGTAGATATCAGTGCATTGCGAGGAATGCAGCTGGTACAAGAGAGTCTTCTATTGCTACGCTCAGAATACACATTAAGCCATACCTAATCGCCGGGCCTGAAGACATCGTTGCACAAACTGGAGGTAGCGTGACCTTCAAATGCAGGGTTGGAGGCGATCCCTTACCTGATGTTTTGTGGCGAAGAACAGCCGGAGGTGGTAACATGCCACTCGGGCGTGTCAAAGTGCTAGACGATCGTAGCCTCAGGCTTGACAACGTGATTTTGACAGACGAAGGGGAGTACACATGTGAAGCTGACAATGCTGTAGGAGCTGTGAGTGCCACAGGATATTTAACTGTATATGATCCCCCAACAGTTACATTAAAACCAAGTTCAGTTATAGTAGAAAGTGGAACAAGTGTTACATTTACATGTATGTCGACTGGAAAACCCCAAGCTACCATGTTTTGGAGTCTAGAAGGCAACCGAACAATAATATTACCAGGAACATCGAAAGGTAAATACCACGCTTCACCAGTATTGGATAATGTGACAAGATTAACAATCAATAATACTAGTAAAAATGAAAGCGGGAACACAGTAGTTTGTTCTGCAGTCAACTTTGCCGGAAGTTCTTTTATAAGAGGAAAAATAAGCGTGACAACAGACGATGACCGACCACCACCAATCATAACAAACGGTCCCTCTAACCAAACGTTACCTATAAAATCGATGGCAGTTTTTCCCTGTGCGGCCATCGGCACACCTGAACCAATTATTGCCTGGTATTTCGAAGGGGAAGCTTTAATACAAAATCAGAGAAGAAATATATCAAGTGATGGAACATTGACATTAAGAGATTTAGACAAAGAAGATAGTGGTACTTACACATGTGTTGCATCATCACAACATGGAAAGTATGTATGGAGTGGGGTTTTATTAGTTGACAGTCCAACAAATCCCAACATCCACTTCTTCAGAGCAGCTGACACATCAGCCTTACCAGGTAGACCAACGAAACCAATAACCTATAATATAACAGATACGAGTGTGACTTTAACATGGAATCAAAATAACAAAATTGGTTCATCTTCTATTATCGGATATCAAATAGAAATGTTTTCGAGGGAGACATTGTCAGGAAAAAATACCCCCAGAAACTCCAGAGGGTGGGTGGTTGTTTCAAAGAGAGTATATCAAACACATTTTGTAGTAAAATCTTTGATACCCGGTATAACGTATATGTTTATGGTGAGAGCAGAAAATTCTCATGGATTATCAGCACCGAGTCTAGTTTCGGATGCCATAACCGTAGGCGATGATACAAACGGTTTGTGGGAAGGTGGATTATTTGCAAATAACACCGAATATAGAAAAAATATTATGACTGATAATATTGTAGACTTAATTGAGGCGACGCCTATCGATTCCAAGACTATAAAGCTCATGTGGGAGATATTAAACTTCTTTTATCTAGAGGGCCTATTCATATATTATAGACCCTTAGATAATAAGACAATCGATTACGAAATGAAAACTATTTTGCATTCCAATGACGTATCTGGCTATGAAATAACGACCCTACGAAAATATATGAAATACGAGTTCTTCCTAGTACCCTTCTATAAGAAATTTGAAGGAAAGCCTTCGAATTCCAGAATAGCTCAGACGTTAGATGACGTTCCCGATGGTCCTCCAACAAATATCGAAATGTACATACTAAACGTCACAACAGTCCATTTGAAATGGCATCCCCCGGAGCCAGATTTGCAGAATGGTGTAGTTATAGGATATAACGTCGTCTTAAATTGGTTGGACATACCGGCTAACAAGTCTATGATAGCTATTAATACCACCGTCTATCAAGCCACAAGTCTTATAATGACTAATCTGACATCTGGCGTCAGTTATTCAGTGCAAATTGCTGCAGAAACTATTGTTGGTTTGGGGCCATTTAGCCAGAAGGTCTATTTGAATATAGATTCGCGCTCCGTAGGATTAGATCCACTGTCAAGATATCCCAATAATGGAGAAGTTTCCATTGTTGCCGGAGATTTTGTAATGGAAACATGGTTTTATTTCCTCATTGGGGCTATTGTGCTGTTTAAAGTTATTATGATTGCTGGCATTATTTATGTTCGAAGACATAACATATTTGCTAAAAAGTCAGCACTACCAAATATTTACAATTCCAATGGCACCAGTCTAGTTACACAAATGAATATTAAGGCCGCTGTATCGCTATCTCATCCGCTAAGTAGTTGTTACAATAAGAACTCCGTTACAAAAACGGAATCACTACTATGGATGGAAAACCAACCAGGCCTATGTATGTCCGGGAATCAAACGAGTCAGAGTAAAGAAAAAACCAATTCTGAATACGATAAAGTCAGTCATCAGCTTCCTGAATATGCAGAAGTCACTGCTTCGAGGGTGACTGGGAATGAATGGAATACGAGCAAAACTGCCACCTCGCCCGCCGCCTACGCCTCAGTGACACTAGTAGCAAACACGAGGCAGTGTGTTAGTTCGCTGGGTTGGTTTCCTCCGGGAAGTAAAAGTATTGATAACTATGAAAACCGCTATCCTGATGAAGAATTATACCCAGCTAGCAACGGAGGATATTACAACAGAAATGTGTATAGCGAGAAATATTTTCAAGGGCATCCTAACGTCTTGAAATTTTACGACCTTCCATCGGTAGACAAAACCCAAAAAGCGCAAATAAGATACAACCAAAGTTTAGATGAGAGGAAAGTTGATAAAAACTTTAAGGCTGAAGTAACCCAATCTCTCATAGGACGAAGTTACGGCGTGAGTTCTAGAAAAAATTCGGAATCGGAAACTACATACAGAGCATTCGGCGAGGATGATTCTGATGATGAACATTACAGAGATGATGGTGGCTATGATGAACTTCAAGCGATGCAGCCTCAGAGACAGAGATCGAAACATCAGTACGAAAGACCAAATTTCGATAATGACCCGACTAGAACACTAGCCCGTCTTACATCTTTTAAACAAGGTCAGAACATTCACAGCGGTAACCCAGGATCCCTAGCGCCAACCCAACCTCCACCGGCTCCGAATCCAAGGATTGATTCAGTGACACGGTAG

Protein sequence:

>DPOGS211939-PA
MSSGEAPYIIEQPIDVTVARHQPATLNCRAGGSPTPTIRWYKSGVLVVSDTHRSLLPAGGLFFLRATHGRAHSDAGVYWCEATNPYGTARSRNATLHVAVLREEFRLEPVTSQTAQGETVILECSPPRGSPEPTVYWKKDGQILHFDGDSRMHLVDGGSLVIQDATQTDAGRYQCIARNAAGTRESSIATLRIHIKPYLIAGPEDIVAQTGGSVTFKCRVGGDPLPDVLWRRTAGGGNMPLGRVKVLDDRSLRLDNVILTDEGEYTCEADNAVGAVSATGYLTVYDPPTVTLKPSSVIVESGTSVTFTCMSTGKPQATMFWSLEGNRTIILPGTSKGKYHASPVLDNVTRLTINNTSKNESGNTVVCSAVNFAGSSFIRGKISVTTDDDRPPPIITNGPSNQTLPIKSMAVFPCAAIGTPEPIIAWYFEGEALIQNQRRNISSDGTLTLRDLDKEDSGTYTCVASSQHGKYVWSGVLLVDSPTNPNIHFFRAADTSALPGRPTKPITYNITDTSVTLTWNQNNKIGSSSIIGYQIEMFSRETLSGKNTPRNSRGWVVVSKRVYQTHFVVKSLIPGITYMFMVRAENSHGLSAPSLVSDAITVGDDTNGLWEGGLFANNTEYRKNIMTDNIVDLIEATPIDSKTIKLMWEILNFFYLEGLFIYYRPLDNKTIDYEMKTILHSNDVSGYEITTLRKYMKYEFFLVPFYKKFEGKPSNSRIAQTLDDVPDGPPTNIEMYILNVTTVHLKWHPPEPDLQNGVVIGYNVVLNWLDIPANKSMIAINTTVYQATSLIMTNLTSGVSYSVQIAAETIVGLGPFSQKVYLNIDSRSVGLDPLSRYPNNGEVSIVAGDFVMETWFYFLIGAIVLFKVIMIAGIIYVRRHNIFAKKSALPNIYNSNGTSLVTQMNIKAAVSLSHPLSSCYNKNSVTKTESLLWMENQPGLCMSGNQTSQSKEKTNSEYDKVSHQLPEYAEVTASRVTGNEWNTSKTATSPAAYASVTLVANTRQCVSSLGWFPPGSKSIDNYENRYPDEELYPASNGGYYNRNVYSEKYFQGHPNVLKFYDLPSVDKTQKAQIRYNQSLDERKVDKNFKAEVTQSLIGRSYGVSSRKNSESETTYRAFGEDDSDDEHYRDDGGYDELQAMQPQRQRSKHQYERPNFDNDPTRTLARLTSFKQGQNIHSGNPGSLAPTQPPPAPNPRIDSVTR-