Monarch geneset OGS2.0

DPOGS206558
TranscriptDPOGS206558-TA4377 bp
ProteinDPOGS206558-PA1458 aa
Genomic positionDPSCF300108 - 708244-724159
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0043750.078.61% 
BombyxBGIBMGA013795-TA0.075.22% 
DrosophilaDscam3-PB0.032.64% 
EBI UniRef50UniRef50_A8JR350.033.36%Dscam3, isoform C n=9 Tax=Drosophila RepID=A8JR35_DROME
NCBI RefSeqXP_968319.20.033.54%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastpgi|1892421220.033.54%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastxgi|3407122100.033.03%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus terrestris]
Group
Gene OntologyGO:00055158.4e-15protein binding
KEGG pathway 
InterPro domain[890-995] IPR0089572.5e-25Fibronectin type III domain
[898-993] IPR0137837e-22Immunoglobulin-like fold
[901-985] IPR0039618.4e-15Fibronectin, type III
[499-592] IPR0130981.5e-12Immunoglobulin I-set
[411-483] IPR0035985.5e-11Immunoglobulin subtype 2
[201-302] IPR0035995.1e-09Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206558-TA
ATGCAATGTGCTGCCACTGGAGATAGACCTCCACAGTTTGTTTGGGAGCGAGATGGAGTTGCTGTATCAAGCAACACAGATCCTAGGTATGCGTTGGGACAAATTATGACGGCAGATAACTCTGTAATAGCCCAACTCAATATAACCAGAGTCCGAGTGGAAGATGGCGGATTGTACGCTTGTATAGCGAAGGAAGGTGAACATTCAGCCAGCAGTGAAAACAGACTGGACGTTTATGATTTAGATAAAAGAAGACGATTACGTCGAGCTCTCACAGATAATAGGCTGGTTATAACCCAACATTTTACTGAGCGAACCGTAACTCCAGGTGGTGATATAAGTATGCAATGTGCCGCCACCGGAGATAGACCTCCACAGTTTGTTTGGGAGCGAGATGGAGTTGCTGTATCAAGCAACACAGATCCTAGGTATGCGTTGGGACAAATTATGACGGCAGATAACTCTGTAATAGCCCAACTCAATATAACCAGAGTCCGAGTGGAAGATGGCGGATTGTACGCTTGTATAGCGAAGGAAGGTGAACATTCAGCCAGCAGTGAAAACAGACTGGACGTTTATGGTCCACCTTACATTCGGTCACTACCACCAATTAAGGCCCAAAGCGGTGAATCAATAAATCTTAGATGTCCATTCTATGGATATCCAATAAGCAAAATCGATTGGGAACACAAAGGCAAAACGGTTAACCTAAACAGTCTTTTTCAATCACGTTATAAAAGAACACAAAATCACCAAAAAAGAAAATCAAGAAGAAGCATTGTCAATATACACGGAGTATTGACAATTCCTGAAGTAAATAAAGAAGATAATGGAGCTGTTTATACTTGTATAGTGACATCGCCGTCTGGTGAAATGGCTAGACGATCGTTTGAAATACAAGTTATAGAGGCACCCATTTTGGAGGATTTACTTCTTGGTAATAACCTACAGGAAGGACAAATAGTAAATATTTATTGCAACGTACGAAGCGGTGATTTACCAATACATTTCGAATGGTTAAAAGACGGAAAAAGAATTTCAAGCAATTTGAAGGTAATCGAAAGAAGTTCGGAATTATTCAGTGCTCTGGTTATCAAGAAAGTTGCGTTGGAACATTGTGGTACATACACTTGCGTAGCGTCGAACCACGTAGCCAAGGTCAATAAGTCCACGGAATTATATATCAAAGTTGCACCGAAATGGCTAGAAGAACCTTCAAATTCGTCTCTTTTGCTCGGAAGAAAAGGCATTGTATCTTGCAGCGCGAGCGGCTATCCACAGCCACAAGTACATTGGATGAAAAAAGATGCTCTTCTTGGTACTTGGCAACCTGTCCTTGAACTAGCTGGAGGAGGGATCCTAAGTCTTCCCAACGGAAGTCTTGTCATTGAAGAAGTTTCCTTGACAGACGAGGGCTTGTATTCATGTAATGTGGAAAATGGTGTTGGAACACCACTGAGCAAAACTGTCTGGATGACAGTGAATAAACCAGTGCACTTTGACACGTCATCAGCTAATGTAACATCACGACTGGGCCACGCCGTCTCTTTGGAATGTCGCGCGCTTGGTGATGACCCTATCAGAATCACATGGAACCATAATGGAAACAATATAGACTTTCAGAGTCACAGAGCAAAGCACTCAGAGACAAAAACCTCGCTTGGTGTTACCAGCACCATAAACATACAGTATTCAGAAACAGCTGACGCTGGCTTCTATCAATGTCGAGCGAGCAATCCGTATGGATCGGCTAGCTTCAACACATTCCTTACCATATTAGAACCCCCCACCTCTCCCTCCGAACTCAGGGTAGAAACAGTTCGATCTCGTACAGCATCAGTGTCATGGCGGGACGGGGCGCACGCTGAGTATTACACATTACAACACACGCCTGCTCATTACGCTGATGACTGGGAATACGCCGTTAGTCTCAATATTACAAGGAAAGGCAACGACATTCGACAAACGATAGAGCTACAAAACATCAGACCAGCGACGGCTTACGCGGTGCGAGTAGCGACTGGTAATGAAGTGGGTGTCAGTCGTTTCACAACGCCTGTACATTTTACAACACACGAGGAAGCTCCATCTTCAACACCAATTAATATACAAATTGAACAAACTGAAAATCCTGGCGAACTGTTTGTCTCCTGGCTGCCTCCACCCAAAGACACTCACAACGGCATCATAACAGGATACCACGTGAAGGCTGTTCCACAACAAGGTTCCCCTATTTTAGCAGATACTGATACAAGGACTTTGAAAGTTTCAAAGTTATCCGGAAAACAAGAGACTGTCATCAGTGGTCTACTCAAAAATACCAGATATGCTGTCTCCATATCAGCTTTCAACAGTGCCGGATCAGGTCCATTTTCGATTCCAGTATTTCAAGATACTATGGAAGGAGCTCCCGAAATCGCTCCATCATCAGTTGAGTGTACGGCCGTATCATCATCGTCACTGCGCGTCGGCTGGCAGCCCATAGCAATACACGAACAGGGAAGTTCTTTGATCGGTTACTCCATTCTGTATGGAACTGAAGAAAGCTCTTGGCAGAACCAAACTTCTCTCCACACTGAAATTTATCTTCAAGGTCTTTCTAAATACAGCAACTACACGATAAAAGTTGCAGGTTTTTCCAACTACGGTGCCGGGCCGTTCTCGTTCCCAATAGTTTGCACTACGTTACAGGATGTTCCTGATCCACCTTCAGATATCAAAGTACTTATCCTATCATCCACGTCACTGTTGGTCAGTTGGAAGAAGCCGGAACATGCTAACGGGGAACTTTTATATTATACGGTTTATGTGAAACCAACTTCAGGCACTGGTCCACCTCTAACGAGTCGTGTGGATCCTTCTCGGACGACAGCTGATATCAAGGAGTTGACGAGCGGAAGGTCGTATGAAGTGTGGGTGACTGGAAGTACCGCGGCTGGTGAGGGCGCACCTAGTAGGAGAGCCTCACATACACCCGCTAATAGAGTTGTGGCAGGTGTCGCATCATTAGGTGGCACCATATCAGTGGGAGTGGGTTCATCCTTACTACTCGTGTGTCAGTGTATCGGAGTACCACCACCTCGCACTGTGTGGTACCACAAACATAATATAATAACTCACCATCCAAGATTCACGAGAAATCATGACGACAGCTTGCTGATTAACAATATAGACCAATCCCTGAGTGGGAACTACACGTGTCTTGCAAAGAATCTTTATGGTTCTGATTCAGTGGAGTACACCGTTGTAGTTCTTCCGCTGCCCGAAGCACCAACACTACGAGCCACGCCATATAAAGACTCCATATTAGTTGAATGGGAACAACCTTATTATAGCATCAGTAACCGTAGCTCGAACCAGAAGATTACTTATAGCCTAACATGGAAAGAAGCAAGCGGTCCGTGGCAAGAAGTATGGTCACCAAATAAAGTACCTAATAAATTATCCAACCTCTCTGGGGTTCAAAAGCATGCGCTAACTGGCCTTAAATGTGGTACAAAATATTCATTGAGGATCACAGCTACTAACAAAGTTGGGACGTCACAACCAGCTTATTTGGATGTTAGCACGCTGGGCGGACCTCCCATTGCGCCGACCTCAACGGAGTGGTTCTGGAGCAATTGTTCCCACGTGTTTATCCAGGCTGCTGGTTGGGATGACGCGGGATGTGAGCTCAGAACCTTGGAGTTGGAACATCGAGCTCTTGGAGCCAGGAGTTGGATGAGACCTATCAATATATTGTCGTACACGGGCTATCCCTACCAGTATCGAGGATCCTTCGCTCTGTCAGGATTGTCCCCCGGGACATGGTACGTCCTCCGAATCACCGCCACCAACGAGGCTGGTAGCGTTACCACCGTTTACAACTACGCCACTAAAAACGAGGATGGCAGTGAAGTTGGTCCACCGTCGGAAATCTTTGACATCAACATGCTCGTGATAGTCCTAAGCTCAATTCTCCTAGCCGTCTGTCTCATCTGCTGCGTTTATATTCTGGTTAAGAGACAACGCAATGGCAACTTAACGGAATACCGTGACTCAATAACAGTCGACAAATCTGAGAGTGGCAACATAACAGCAAACACGTCACACAGTAATCTAGCGAATGTCAAAGAGAATGCTATGAACGCTCAGAACAGAATATACAGCGCTCCAATACATGTTAGGAATAATAGCAAACATGAATTATATGAGATAAGTCCTTACGCCCAATTCGCTGTCGGCTTTCGAACGTTCGGTCACGCTGAAAACAATGATGTTCCAAGACATATCAAACACAGATACGATACAGAAACGAGCTTTCAAGTTTGCTCTGAGTCAGAGGACAGTGACAGCATATCAAAATCAACCCTGAAGAGCGTACCGAGAAGTAAGTAG

Protein sequence:

>DPOGS206558-PA
MQCAATGDRPPQFVWERDGVAVSSNTDPRYALGQIMTADNSVIAQLNITRVRVEDGGLYACIAKEGEHSASSENRLDVYDLDKRRRLRRALTDNRLVITQHFTERTVTPGGDISMQCAATGDRPPQFVWERDGVAVSSNTDPRYALGQIMTADNSVIAQLNITRVRVEDGGLYACIAKEGEHSASSENRLDVYGPPYIRSLPPIKAQSGESINLRCPFYGYPISKIDWEHKGKTVNLNSLFQSRYKRTQNHQKRKSRRSIVNIHGVLTIPEVNKEDNGAVYTCIVTSPSGEMARRSFEIQVIEAPILEDLLLGNNLQEGQIVNIYCNVRSGDLPIHFEWLKDGKRISSNLKVIERSSELFSALVIKKVALEHCGTYTCVASNHVAKVNKSTELYIKVAPKWLEEPSNSSLLLGRKGIVSCSASGYPQPQVHWMKKDALLGTWQPVLELAGGGILSLPNGSLVIEEVSLTDEGLYSCNVENGVGTPLSKTVWMTVNKPVHFDTSSANVTSRLGHAVSLECRALGDDPIRITWNHNGNNIDFQSHRAKHSETKTSLGVTSTINIQYSETADAGFYQCRASNPYGSASFNTFLTILEPPTSPSELRVETVRSRTASVSWRDGAHAEYYTLQHTPAHYADDWEYAVSLNITRKGNDIRQTIELQNIRPATAYAVRVATGNEVGVSRFTTPVHFTTHEEAPSSTPINIQIEQTENPGELFVSWLPPPKDTHNGIITGYHVKAVPQQGSPILADTDTRTLKVSKLSGKQETVISGLLKNTRYAVSISAFNSAGSGPFSIPVFQDTMEGAPEIAPSSVECTAVSSSSLRVGWQPIAIHEQGSSLIGYSILYGTEESSWQNQTSLHTEIYLQGLSKYSNYTIKVAGFSNYGAGPFSFPIVCTTLQDVPDPPSDIKVLILSSTSLLVSWKKPEHANGELLYYTVYVKPTSGTGPPLTSRVDPSRTTADIKELTSGRSYEVWVTGSTAAGEGAPSRRASHTPANRVVAGVASLGGTISVGVGSSLLLVCQCIGVPPPRTVWYHKHNIITHHPRFTRNHDDSLLINNIDQSLSGNYTCLAKNLYGSDSVEYTVVVLPLPEAPTLRATPYKDSILVEWEQPYYSISNRSSNQKITYSLTWKEASGPWQEVWSPNKVPNKLSNLSGVQKHALTGLKCGTKYSLRITATNKVGTSQPAYLDVSTLGGPPIAPTSTEWFWSNCSHVFIQAAGWDDAGCELRTLELEHRALGARSWMRPINILSYTGYPYQYRGSFALSGLSPGTWYVLRITATNEAGSVTTVYNYATKNEDGSEVGPPSEIFDINMLVIVLSSILLAVCLICCVYILVKRQRNGNLTEYRDSITVDKSESGNITANTSHSNLANVKENAMNAQNRIYSAPIHVRNNSKHELYEISPYAQFAVGFRTFGHAENNDVPRHIKHRYDTETSFQVCSESEDSDSISKSTLKSVPRSK-