Monarch geneset OGS2.0

DPOGS213998
TranscriptDPOGS213998-TA3918 bp
ProteinDPOGS213998-PA1305 aa
Genomic positionDPSCF300389 + 32322-61388
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0171570.061.58% 
BombyxBGIBMGA013795-TA0.048.17% 
DrosophilaDscam3-PB8e-15730.46% 
EBI UniRef50UniRef50_Q7KSE91e-15430.46%Dscam3, isoform B n=9 Tax=cellular organisms RepID=Q7KSE9_DROME
NCBI RefSeqXP_002069676.15e-16531.15%GK11449 [Drosophila willistoni]
NCBI nr blastpgi|3503989087e-15530.77%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus impatiens]
NCBI nr blastxgi|3503989087e-16230.33%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus impatiens]
Group
Gene OntologyGO:00055153.1e-12protein binding
KEGG pathway 
InterPro domain[853-919] IPR0137834.2e-21Immunoglobulin-like fold
[410-529] IPR0089577.8e-20Fibronectin type III domain
[328-420] IPR0130983e-12Immunoglobulin I-set
[729-820] IPR0039613.1e-12Fibronectin, type III
[339-411] IPR0035985.5e-12Immunoglobulin subtype 2
[129-238] IPR0035995.4e-07Immunoglobulin subtype
[137-221] IPR0131518.1e-07Immunoglobulin
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213998-TA
ATGGCAAGGAAGAAAGCAGAAGGCATGTACACTGGATATCTTGGCATTTACAGAAGAAAAAGAACCCTACCAATGATGATATCTCATCACTTTAGCGAAACCACAGTAACACCTGGTGTTGATGTAAATCTTCAGTGTGTTGTTAACTCTCCTCATCCAGCAAGATTTGTTTGGGAACGAGACGGCGTTGTCATTTTATCTAATACTGATTCAAGGTATTCCATAACACAAACAATGACAACGGATGGTGTGTCTACACAACTTAATATTTCCCACGTTAGAGTTGACGATGGAGGTCGGTTTGCTTGTGTGGCTCATCTTGGTGAATCCACAGTGTTCCATGAAGATAGGGTCAATGTTTACGGTCCACCTTACATCCGGACGTTACCACCCTTCAAAGTTCAAAGCGGTCAAAGTGTCACACTCAGATGCCCTTACTATGGATATCCAATACGAGAAATAACTTGGGAGTACAAGGGAAAAGAAATTATTCCAGAAACAACTCAAACTAGATACAAACGCTTCATTAATGATACCGCTAATATAGAAATTTTCGGCCGAAAACCAAAATTAAGAAGTAAAAGAGACGCAACAGTGACGAATGGCGTTCTTAGTGTTAATAGAGTTTCTAAAAGCGACAATGGACTTTTTGCGTGCATAGTAAGAAGTCCTTCAGGTGAAATGGCAAAACGCTCTTTTGATCTGCAAGTAGTGGAAGCACCTCAACTGGAAGAAATACTACTGGCCCCTGATTTACAGGAAGGACAAATTGTGCAAATACATTGCAACCTCAAAATAGTGTCAGAGACTTGGCGACCAGTTTTGGAAGTGGCTGGTGGAGGGGTTTTAAGTTTATCGAATGGGTCACTCATATTTGATTCTGTTGCTTTATCTGATGCTGGAATTTATACGTGTCATGTAGAAAATGGGGTTGGGGAAGCATTGAGCAAAACTATTTGGATATCAGTAAATAAACCGGTAACGTTTGATATAGTATCCAGAAATTTAACAGCAAAATTAGGTCAACATGTTACCATCGAATGTCAAGCAAAAGGAGACGATCCAATTCGGATCATGTGGACGAGGAATGGGAAACCAATTAATCCTCTTACACAAAGATTAAAAATATCGGAAGCTAAATCAGACGATGGTATGACGAGTTTGCTTGAGTTGATACAATCAGAGACAGGCGACGCCGCTTTATATCAATGTAAAGCTGGAAACCCCTTTGGAGCTGACGTTTATAGTGTTTATTTAAGTATACTAGAACCTCCGTCACCACCAACGGATCTTACGGTGGACTCTGTAACAAGCCGATCTGTTAAACTCTCATGGAGGGATATGACCCGTTCCTTAACGCAATACTATAGTGTTCAGGTCACGAATAGTGATAGACTAATGTGGAGTACCGCAAGGACCATCAATGTAACCAGGTTAGGCGATAATCAGCATAGTGTGGATATCACTGGTCTGCAGCCTGCCACACGGTACGCAGCTCGGACGGCGGCGGGGCGATCCTCTGACATTAGTGCTTATTGTGCTCCCGTGAGATTCACAACTTCAGAGGAAGCGCCTTCATCTCCTCCACTAAACATACAAGTATCGCAAACAACGTCTCCGGGAGAATTACGCGTCAAATGGTTACCCCCTCCAGCTGATACCCTGCATGGAGTAATCTTAGGGTACAGGGTAAAAGCTGTACCGCAGGAAGATACCGGTATTCAAGAAACGAGTGACAAAATTATCAAGACTACAGCACTATATTCAAAACAAGAAACCGTCATATCTGGCCTCTTGAAAGGAGTCAGGTATTCGGTATCGATTGCGGCGTTTAACAGCGCCGGGAATGGACCCTTCTCCATACCGCTATTTCAAGATACAAGAGAGGGAGCCCCCGAAGAAGGTCCAACCTCAGTCGAATGTGGAGGTGTGACGTCATCAGCACTTCGAGTTAGCTGGCAGCCCATACCTGTTCACAGACAAGCCGGCTCTCTCGTTGGCTATTCCGTGTTATTCGCGGCTCAAGGTCGTCCATGGCAAAACGCAACATCTATAGTCACGGAAATGCGTTTACAAGGACTCCATAAGTTTACTAACTACACCGTCAAAGTTGCGGGATACTCTAACTATGGAATAGGACCCTTTTCCTTCCCCATTGTGTGTTCTACGCTACAGGATGTTCCGGATGCTCCGTCTGAGATCAAGCTTCTCGTGAGTTCAGCTAATTCCCTGCTGGTGAGCTGGAAACCGCCGCGGCCGAATGGAAGACTTCTGCATTACACCGTGTACTCTAAATTGACCGCCAGCAATGATGGCCCGCAAATCCATCGTGTGGATATAGAGTCAAACATAGATGCTTACGAACAAACTCAAAGTTTGGAACTCAAAGGTTTGGTAGAGGGCAGACAGTATGACGTGTGGGTGAGTGCTAGTACAGCTGTCGGCGAAGGTCCGGAAAGCAGACGGGTCAGTAACGCACCATCACAAAGAGTGGTAGCTGGTGTTTGGTCTCTTGGGGGAAGAGTGTCAGTTCGTGTACACTCAGCGTTACCACTGGCATGTCGGAGCGTAGGCTCTCCACCACCACGTACTGTGTGGTACCACAATCATAATATCATTACACACCATCCGCGATTCACGCGAAATAAAGATGACAGCTTACTTATTAAGAGCATAGATCAATCGCTCAGTGGCAACTACACTTGTCTTGCAAAGAACCTATATGGATCAGACTCAGTCGTATACTCAGTGAGAGTCCTACCACTACCTGATCCACCATCTTTGAGAGCGACACCTTATAAGGACTCAATAGTAGTTGAATGGGATGAGATAAAAATTTCTAACGAATCTGGTTTTGGCGTTAGCTACAATCTTACCTGGCGTGAAGAAGATGGTCCTTGGCAAGAAGCTTGGCCCACAACACGGTTGCCAAATTCCCAACAGCAGCTTCCAGGTGTCCAACAGCATGCCCTGACTGGTTTGAAGTGCGGCACTAAGTACTCCATTCGAGTTACCGCTACAGATAGCGTTGGCACATCCGCACCAGCTCATGTTGATGTTACTACTTTGGGTGGAGCACCAGTCTCACCATTATCGACTGACTGGCTGTGGAGCAACGCTACTCACATTTACATACAGCTGAGTGGTTGGGACGACGGTGGTTGTGACGTCACAAAGTGGGACGTTGACTATCGAGCTCTCGGTACAAGTTTCTGGCACCGAGCTGATAATTTAGCTGTCCACACTAACTCTCCACACCCACAGACTTTAGATCCCAACCTTGGTTGGGGCTACAATTACGCGCGATTACCTACATCTTACGCGCTCGGCTCCCTTACCCCTGGGACGTGGTACCAAGTCCGCGTGACTGCATACAATGATGCTGGAACAGCCGCTACAGTATACACGTACGCTACCAAGACGGAAGATGGTGAAGAAGTTGGTCCACCATCAGATTACTTCGATTTGAATATGTTGGTAATAATATGCAGTTCAGTGCTGTTGGTGATATGTCTTCTTGCATTTATCTGTATACTGTTGAAAAGGCATAGACAGAATTACTCATCATATCGCCACTCAATGGCTGAAGAAGTTAAATCTCGGGACGAAAGTGTCGCATCACATTCTGAGCACAAGGAACGCTATGCACATACTCCTAGGATTTATACCTCCCCTGTACATCCCAGGAAGACTAGTAAGAATGAAATGTTTGAGATAAGTCCTTATGCCGAATTCGCTCTTGGATTCAGGACTTTTGACCACGTTGAAAACCAAGATTTGCCTAGCAGACTTCCATCTAGACCGAGGTTTGATACTGTAGTCCTTAATTTACCCGAAGGATTCTGTTCCTTCTCCGACGATTATCTATATCGCCGTACCGTACCGTACCGAGTAGATGGACAGTGCATTTTAGAGTAG

Protein sequence:

>DPOGS213998-PA
MARKKAEGMYTGYLGIYRRKRTLPMMISHHFSETTVTPGVDVNLQCVVNSPHPARFVWERDGVVILSNTDSRYSITQTMTTDGVSTQLNISHVRVDDGGRFACVAHLGESTVFHEDRVNVYGPPYIRTLPPFKVQSGQSVTLRCPYYGYPIREITWEYKGKEIIPETTQTRYKRFINDTANIEIFGRKPKLRSKRDATVTNGVLSVNRVSKSDNGLFACIVRSPSGEMAKRSFDLQVVEAPQLEEILLAPDLQEGQIVQIHCNLKIVSETWRPVLEVAGGGVLSLSNGSLIFDSVALSDAGIYTCHVENGVGEALSKTIWISVNKPVTFDIVSRNLTAKLGQHVTIECQAKGDDPIRIMWTRNGKPINPLTQRLKISEAKSDDGMTSLLELIQSETGDAALYQCKAGNPFGADVYSVYLSILEPPSPPTDLTVDSVTSRSVKLSWRDMTRSLTQYYSVQVTNSDRLMWSTARTINVTRLGDNQHSVDITGLQPATRYAARTAAGRSSDISAYCAPVRFTTSEEAPSSPPLNIQVSQTTSPGELRVKWLPPPADTLHGVILGYRVKAVPQEDTGIQETSDKIIKTTALYSKQETVISGLLKGVRYSVSIAAFNSAGNGPFSIPLFQDTREGAPEEGPTSVECGGVTSSALRVSWQPIPVHRQAGSLVGYSVLFAAQGRPWQNATSIVTEMRLQGLHKFTNYTVKVAGYSNYGIGPFSFPIVCSTLQDVPDAPSEIKLLVSSANSLLVSWKPPRPNGRLLHYTVYSKLTASNDGPQIHRVDIESNIDAYEQTQSLELKGLVEGRQYDVWVSASTAVGEGPESRRVSNAPSQRVVAGVWSLGGRVSVRVHSALPLACRSVGSPPPRTVWYHNHNIITHHPRFTRNKDDSLLIKSIDQSLSGNYTCLAKNLYGSDSVVYSVRVLPLPDPPSLRATPYKDSIVVEWDEIKISNESGFGVSYNLTWREEDGPWQEAWPTTRLPNSQQQLPGVQQHALTGLKCGTKYSIRVTATDSVGTSAPAHVDVTTLGGAPVSPLSTDWLWSNATHIYIQLSGWDDGGCDVTKWDVDYRALGTSFWHRADNLAVHTNSPHPQTLDPNLGWGYNYARLPTSYALGSLTPGTWYQVRVTAYNDAGTAATVYTYATKTEDGEEVGPPSDYFDLNMLVIICSSVLLVICLLAFICILLKRHRQNYSSYRHSMAEEVKSRDESVASHSEHKERYAHTPRIYTSPVHPRKTSKNEMFEISPYAEFALGFRTFDHVENQDLPSRLPSRPRFDTVVLNLPEGFCSFSDDYLYRRTVPYRVDGQCILE-