Monarch geneset OGS2.0

DPOGS213440
TranscriptDPOGS213440-TA5229 bp
ProteinDPOGS213440-PA1742 aa
Genomic positionDPSCF300356 + 56223-78441
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0130600.085.29% 
BombyxBGIBMGA005747-TA0.031.87% 
DrosophilaCG42330-PD5e-17528.04% 
EBI UniRef50UniRef50_A8JR350.030.19%Dscam3, isoform C n=9 Tax=Drosophila RepID=A8JR35_DROME
NCBI RefSeqXP_001955116.10.030.18%GF18608 [Drosophila ananassae]
NCBI nr blastpgi|3407122100.031.13%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus terrestris]
NCBI nr blastxgi|3407122100.031.14%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus terrestris]
Group
Gene OntologyGO:00055151.3e-14protein binding
KEGG pathway 
InterPro domain[1106-1207] IPR0137831e-24Immunoglobulin-like fold
[1109-1209] IPR0089574.3e-23Fibronectin type III domain
[1008-1097] IPR0039611.3e-14Fibronectin, type III
[792-874] IPR0130983.1e-11Immunoglobulin I-set
[1328-1391] IPR0035983.7e-10Immunoglobulin subtype 2
[787-876] IPR0035991.2e-09Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213440-TA
ATGATAGCGATAACGCTGGTGACGTCAGTCGCGTCAACAGACAAACGGCCACATTTCACCGTGGAACCTCCACCACGCGTGTTATGGCCGGCCACAAGGGGTGCCCACGCGCTTTGTCGAGCCTCAGGACACCCCACACCAGAAATCCACTGGGTGACTGCCGAAGGTCAGCTAATCACCACCATCCCTGGCTTAAGGCACGTGTTAGCGGATGGTAGGCTAGTGGTGGGTGCTCGAAGGTCACTTGTAGATGCAGGTGGTGGTGGTAGCAGTTCCTTATTACGATGCAGGGCTTCCAACCGAGCTGGTGTCACACTGTCCAGACCCATGTTACTACAAGCAGTGGAAGACAGCGTCTTATCACATCTTGTGAGACAGCTAATGCCAGCGAAGGTCGGAGGTGCTGTTGTGCTCCGTTGTGAATTGTCACCACAGTTGGGATTGGTAGACGTTACGTGGCTTCAAGATGGCTTGCCATTATCATCAGGATATATGGGAGCTGATACACGGTGGGTGGCTGGGGCTGGCTTGTTACTTGGCTCGGATTTAACGCAGGCTGATATACAAGCGGAATATTCATGTTTGGCGCTTGATGTCCCCAGCCCGTCTTTGAAACTTACAACAGATGATAGTAGTTGGCTGGAAAATATTGAAGTTAATTCTATTGAAGCTCGTATTGGAGATACAGTCTTAGTGCCATGTATTATAAGACATATCAATAAACACGCTATTATTTGGCAGCGACAGGAAGCAAACGGTGTGTGGATAACTGCTACTGAGGGTGTTGTTCGTGGAGGTACACTTATTTTGCCCAATGTGCGCGCTCACCATGCGGTGCGATACGCTTGCACAGCGGGCACTGAAACCTCACCTCGTGTTATTGTACGTCTAGTTTTATACGAGCCTTTAACAGTTTCAGTCACTCCTAATCCACTGATGCTCGGTACTGGTGGTAGCGGTAGTTTTAATTGCTCAGTTCATGGCGGCAGAGGTGGAGGGGTTACTGTAAGCTGGCAACATGAAGGTCGTCCATTGCACTCTCCTCAACCGCGACTAGCTTTAGGACCAGTTCGTTTACACCATGTCGGAATGTATCAGTGCACTGCGTTTGATCATTCGGATTCCGCTGTGGCTGGAGCTGAATTAGTTATAGCTGACAGACCACCCAGATTGGTATCTACATTTAGTGAAGCAATAACACGCATTGGTCAAACTATTGCGTTTAGATGTGCAGCAACAGGAATTCCACCTCCTAGAATTTTGTGGACTTTAGACAATAGACCGTTGCCAGCAGCACCCGAAAAATATAATGTCAACACTAGAGCTGAAACAGTACCTGGAGGGGAAGAAGGAACGATAGTTTCTACTTTGAGTATAACTATTAGAAGTGCTGATGAAGGCGGTAGATATGGATGTAATGCCTCTAATTCTGGAGGATTTCAAGCACATCACGCACGGCTCAATGTTTTTGGCCCTCCAAATATACGATATTTACCTCCAATGCACATCAAAACCGATAGCGATGTAACATTGTACTGTCCTTATTCGGGTTATCCGTTAAAAGAGGTAGTCTGGCGTCGTGAAGACGGTTCATTATTGGATGGTCCTATTGAAAGCAACGATGAGAAAGGCATGTTGAGATTGACTTCAGTTAGAAATGCGGTATCAGGCAGTTATACTTGCGAAGTAAGAGCTGAAAGCGGAGAACTTGCAAGAAGAACTGTTAGAATTGAAGTACATAAACCTCCTAAAATAGCGCCGTTCCAGTTCCCAGAAGAATTAGAGGTAGGCGGTAGTACACAGGCTACTTGCAGTTTAGTGTCTGGGGACAAACCGATTCAGTTTTCTTGGCATAAAGATAATTTACCTATTCCGTCAACATTGAAAGTTGAACAAAAAAATATGGACTTTTTTACAATATTAGTCATCCAAGACCTAAATTCAATGCATAGTGGAGAATATTCATGCAAAGCCACAAACGACTTTGGTTCCGTAAGCCATAGCGCATCTTTGATCGTTAAAGAGCCTCCGACTTGGGTGTGGCGACCTCAGAATACTTCGGTGTCCGCCGGTGCAGCCGTTCTCCTCCCTTGTTCAGCTAAAGGACAACCAATGCCTAGAATATCTTGGGAATTCACAACAGAAGGAGATAACTGGCGTCATATTCTTTGGAGCCAAACTGAATCGGCAAGTGCCAGCGAGGGTTCTATTTTATCAGATGGTACTGTGTGGTTAAGAGCAGCGACACCAGAACATGAGGGCTGGTATCGTTGCACAGCTCGTGTCCAGCATTCATTTCTTCATCATTCCTTTTTTCTTGACGTTAGGGAACCCCCAAAACTGACAAGAGGGGGTGCGGGAGAGACAAGATGGGTCGTTCGTGGTTCAACCGCAAGACTCACTTGCACTGTACGTGGAGAGCCAGAAATACATGTCCATTGGACTCATGATTCCAGACCACTTACGTTACACGGTTCAGGAGGGGTTGAAATGAGAGATCTGGGAAATGGTGCCATGACTTCGGAAATAACCATAACTCATGCGGGTCCTGAGCATGCAGGTGATTATCGATGCATGGCGAGGAATCTTTATGGAACTGACGAGCTGCTATTCAGGCTTTTTGTAAAAGAGAGACCAAATACGCCAGAAGAGGTGCGAATATCAGAAGTATGGTCGCGTAAAGCTCGGGTTACATGGAGAGTCGCTCGGGGGGCGTTTGTGTCCCACTACTCACTACAATACCGACCGCTCTCGATGGATCTTACGAACGCTCCGCTTAATGCTCCACTCCCCACTTTGATAGACACCTGGGATTCACCAGAAGTTCTCAACATGACACTCGCTAATTCAGACTTACTTCATATTGCAACTGAAGGTCCAAATAGAGCAGGTGCATCACTAGGGGGTCTTTTTCCCGATACGCGTTATGTCTTAAGACTGGCTGCGTATAACGATGTTGGAGCTTCCTCATACACACAGCCCCTTCATTTTACTACACGTGAAGAGGCGCCCGGAGGAGTTCCCCGTGACATCACGGTACGTGCGCTTCGAGCGAGAGAGCTTCAAGTCACGTGGCAGCCACCTCCAAGAGAAACCTGGCATGGTACACTCTTAGGATATACGATTCGGTGGTGGGAAGCTTCTGAAGAAGGCAAGGGCGCCAGTGATACAGGTTCTGAGAGTGGAGCTAGCGCAGACGCCACAAGCACTACCCTCAGGGGTCTTAAATCGGCTACTAGATATGGGATCACCATACGGACGTACAACGCAGCCGGAACTGGTCCATTATCTCCGCCTCATTACCGACATACTCTGGAATCACCTCCAACCGGCGGTCCAGAAGATGTTTCCTGCATATCCAGTAGTTCAACATCACTCCGCGTTTCATGGCGACCCCCAAATATTGATGTTAGAGGGGGCCAAATTACACATTATACATTACAGTACACAAGAAGAGACTCTGATCCATTATTACCAACTCAGGACGCTCATTTGAGAGTACAAGGCGAGGACACAACAATATCTCGTTTAACATCATACGCCGAGTATGAAATACGAGTTCGAGCTCACAACGCTGCTGGAGATGGCCCTCTCTCCGCACCGAAAGTTTGTAAGACCGATGAAGATGTGCCAAGTGCGCCTGGGGGCATGAAATTGATTGCTCTCAGTGAACGGTCACTCAGAGCTTCCTGGCTGCCACCACTGGAGCCAAACGGAAAGCTCACCCATTATTCGCTTTACGTCAAAGATCTATCAGGGACTCAAGAAGCCAACGTGACACGTGTAAACGCATGCACAGAGGGGGACAATATGGTAGAATGCATGAAATCCCTACGCGGCTTACGCGCCGGAAGAACTTATGAGGCCTGGGTCAGAGCTGCTACAAGTGTCGGTGAAGGGCCCCCATCTTCTGTTATTGCTTGTCAAACTAGTGCACTAGCTCCGGCTCGGATATCGTCATTTGGGGGTATTGCAGTTGGAGCAGCTGGTACGTCATTATCTCTTCGTTGTGTTGTTGGAGGAGTTCCTCCACCCGCTAAACGATGGTTAAGAGGAGGCAATCCATTACATCCTCGAAACCCCTTTCATTTGGATGGCGACGCTCTATTAATACGAACATTGGAACTATCACATGCAGACAACTACACATGTGTGGCAGAAAACCCACATGGTTCTGATTCTGTCACTTGGAGACTATATGTTTTACGACCACCCACTCCTCCAAGTTTAGCCTTGCTAGAAGCTAAATCAAGGGAATTGGAATTAGATCTTAGACCAGCTCCTAAAGTACAAGGTCATGCCTTATCAAAAGTATACACATTACATTGGCGACTGTCTGCACCAGAGGGTGAGTGGCGTATGCAATCTGCAAGTCCCGGACCCGTTATACTAACAGGACTCAGATGTGGCTCAGCGTATAGAGCATATGGGTCTGCTGGTGGAGCACCGGGTGCTGAAATATCAATACGAACATCTGGGGGTCCCCCCACCGCACCACCAGATACGAGATGGATTAGAACTAATGCGACTCATGCTAGATTTGATACAAGCACTTGGTCTGATGGTGGCTGCGGACCAGTTACTTTGAAATTAGAATGGGCTGGTTCGGGAACTACTGGTGCTAGAGACGTACCGGTTGGAGGTGAAGTAATTTTAGGAGGTTTGACTCCGGGATCAAGATATCGGGTCGCTGCTCGGGCATCAAATGAAGCTGGATGGACCAGAGAAGTGTATGAATTCACAACTACACCGGCTGAGGGTAGTTACACCGAGGAAGTTGATGCTCCTTTTCCTGCAACTGCCGGCGAGCTAGCTTTGTTGTTAGCTCTTTGCGCAGCGTTATCTCTAGCAGCGCTTACTATATTAGCAATTCTTTTAAGGAGAAAAAGAGGTGAAGGTGCAGTTGCTCGTGTAATCACCAGTGACAAACCAGGTTGTGGTACATACAGACCGCCCCCTCATCCAACACCCAAACTACCTCCTGACCCACCAGATGTGTATGAAATCAGTCCATACGCGACGTTTGCGGGTGGTGACGGTCTCGGTGTGGGTAGCGGTTCGCGGGCGTATACACTACAACTTCGTGCTTTAGCGCGACACGAAGACGATGCTGCTCCACCAGCCCATCCATCGTGTTGTGATGAGGAGACCTGTTTAGATGCCTGTGAGGCCCAACGACGACGCAGAAGACGACGTCACTACTGCCAAGATCATGGTACCATGACTTAA

Protein sequence:

>DPOGS213440-PA
MIAITLVTSVASTDKRPHFTVEPPPRVLWPATRGAHALCRASGHPTPEIHWVTAEGQLITTIPGLRHVLADGRLVVGARRSLVDAGGGGSSSLLRCRASNRAGVTLSRPMLLQAVEDSVLSHLVRQLMPAKVGGAVVLRCELSPQLGLVDVTWLQDGLPLSSGYMGADTRWVAGAGLLLGSDLTQADIQAEYSCLALDVPSPSLKLTTDDSSWLENIEVNSIEARIGDTVLVPCIIRHINKHAIIWQRQEANGVWITATEGVVRGGTLILPNVRAHHAVRYACTAGTETSPRVIVRLVLYEPLTVSVTPNPLMLGTGGSGSFNCSVHGGRGGGVTVSWQHEGRPLHSPQPRLALGPVRLHHVGMYQCTAFDHSDSAVAGAELVIADRPPRLVSTFSEAITRIGQTIAFRCAATGIPPPRILWTLDNRPLPAAPEKYNVNTRAETVPGGEEGTIVSTLSITIRSADEGGRYGCNASNSGGFQAHHARLNVFGPPNIRYLPPMHIKTDSDVTLYCPYSGYPLKEVVWRREDGSLLDGPIESNDEKGMLRLTSVRNAVSGSYTCEVRAESGELARRTVRIEVHKPPKIAPFQFPEELEVGGSTQATCSLVSGDKPIQFSWHKDNLPIPSTLKVEQKNMDFFTILVIQDLNSMHSGEYSCKATNDFGSVSHSASLIVKEPPTWVWRPQNTSVSAGAAVLLPCSAKGQPMPRISWEFTTEGDNWRHILWSQTESASASEGSILSDGTVWLRAATPEHEGWYRCTARVQHSFLHHSFFLDVREPPKLTRGGAGETRWVVRGSTARLTCTVRGEPEIHVHWTHDSRPLTLHGSGGVEMRDLGNGAMTSEITITHAGPEHAGDYRCMARNLYGTDELLFRLFVKERPNTPEEVRISEVWSRKARVTWRVARGAFVSHYSLQYRPLSMDLTNAPLNAPLPTLIDTWDSPEVLNMTLANSDLLHIATEGPNRAGASLGGLFPDTRYVLRLAAYNDVGASSYTQPLHFTTREEAPGGVPRDITVRALRARELQVTWQPPPRETWHGTLLGYTIRWWEASEEGKGASDTGSESGASADATSTTLRGLKSATRYGITIRTYNAAGTGPLSPPHYRHTLESPPTGGPEDVSCISSSSTSLRVSWRPPNIDVRGGQITHYTLQYTRRDSDPLLPTQDAHLRVQGEDTTISRLTSYAEYEIRVRAHNAAGDGPLSAPKVCKTDEDVPSAPGGMKLIALSERSLRASWLPPLEPNGKLTHYSLYVKDLSGTQEANVTRVNACTEGDNMVECMKSLRGLRAGRTYEAWVRAATSVGEGPPSSVIACQTSALAPARISSFGGIAVGAAGTSLSLRCVVGGVPPPAKRWLRGGNPLHPRNPFHLDGDALLIRTLELSHADNYTCVAENPHGSDSVTWRLYVLRPPTPPSLALLEAKSRELELDLRPAPKVQGHALSKVYTLHWRLSAPEGEWRMQSASPGPVILTGLRCGSAYRAYGSAGGAPGAEISIRTSGGPPTAPPDTRWIRTNATHARFDTSTWSDGGCGPVTLKLEWAGSGTTGARDVPVGGEVILGGLTPGSRYRVAARASNEAGWTREVYEFTTTPAEGSYTEEVDAPFPATAGELALLLALCAALSLAALTILAILLRRKRGEGAVARVITSDKPGCGTYRPPPHPTPKLPPDPPDVYEISPYATFAGGDGLGVGSGSRAYTLQLRALARHEDDAAPPAHPSCCDEETCLDACEAQRRRRRRRHYCQDHGTMT-