Monarch geneset OGS2.0

DPOGS202953
TranscriptDPOGS202953-TA4980 bp
ProteinDPOGS202953-PA1659 aa
Genomic positionDPSCF300195 + 226943-259688
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0038490.068.07% 
BombyxBGIBMGA005747-TA0.038.24% 
DrosophilaDscam3-PB0.036.87% 
EBI UniRef50UniRef50_A8JR350.036.70%Dscam3, isoform C n=9 Tax=Drosophila RepID=A8JR35_DROME
NCBI RefSeqXP_396307.30.038.57%PREDICTED: similar to CG31190-PB, isoform B, partial [Apis mellifera]
NCBI nr blastpgi|3407122100.039.19%PREDICTED: Down syndrome cell adhesion molecule-like protein CG42256-like [Bombus terrestris]
NCBI nr blastxgi|1892421220.039.42%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
Group
Gene OntologyGO:00055152e-13protein binding
KEGG pathway 
InterPro domain[838-947] IPR0089573.7e-22Fibronectin type III domain
[841-940] IPR0137832.2e-20Immunoglobulin-like fold
[110-199] IPR0130981.3e-13Immunoglobulin I-set
[1042-1126] IPR0039612e-13Fibronectin, type III
[298-372] IPR0035982.2e-12Immunoglobulin subtype 2
[116-200] IPR0035996.4e-11Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202953-TA
ATGGACGTCCCATGGGAGGTGCAAGTTCAACCTACTCACGGGGTGTCAGGAGGCGCTGCCTTGATGACGTGCGCACCCTCGGCTTCCGTAAGAGCACATGTCACCGTGACGCGATGGATCAAAGATGGAGCTGTTCTTCCACCGGTACCTGAAGTGGGTGGTGGTTATATTATCGGTGGCAGTCGCGGCGATATTCTTGTTGTCCGAGAAGCCAGATCTGATGATGCTAGCTCCTACGCCTGCGAAGCTCAACACGTTTTGACGGGAGAGAAGCGTCGTAGTCCACCTGCTATGATTGCAGTTTCACATCCGACTGGGAGTATGGCACCACGAATTTTAACGATATCTGAAGATGAAACTGTTCCGCAAGGTGGGGACATCAGGCTGGTGTGTTGCGCAATCGGCAGCCCTCCACCCACTTACAGTTGGTTTCGCCACGCAAACGGCCGCCTCAGCCCCGTAGGAAATAGCATCAGAATATCAGTGTCCGAACAAGTACTAATTATAAGAAGAGCTCAGTTATCTGATGGTGGAATTTGGACTTGTCGAGCCCATAACCAATACGGTGAACAGAGACGTGACGCAAGACTTAAAGTGCGGTCAAGACTTGTTGTCAGTGTTCATCCTCAATTACATATCGCTAATTCCGGCAGTTCAGTAACCTTTAACTGTTCGGTTGACGGCGGCGAGGCTCGCGTGCGTTGGTTGCACGACGGTGTCCCTGTTGGCGGCGGTGAGCGGGTATTGCGCATCCACGGTGTGGTCCGAGCTCATAGAGGAATGTACCAGTGTTTTGCTGAACGAGACCTTGATAGTGCTCAAGCCGCTGCTGAGCTCAGGCTTGGAGATACAGCGCCAGAACTGCACTATACATTCATCGAGCAAGCATTACATCCGGGGCCGGCGCTTGCGCTCCACTGCTCTGCCTCCGGCTCTCCCTCGCCACGCTTCACTTGGCTGCTCGACATGCAGGTTGTAGAGGAACATAATACACAACAGAGGAGCATTTCCCAGTTTATGAGTCCCAACGGAGACGTAGTGAGCTACCTTAATATTACGTCAGTCCGTCCTGAAGACGGTGGGAGGTACACATGTCGCGCTCATAATACAAGAGGAGCTGTTGAACATTCTACAAGACTTAACGTTTATGGTCCGCCCTCCATCAGAAACATTGGTCCTGTCCGCGTAGTTGCAGGTGTCAACACAACGATCTACTGTCCTTACTCCGGATTTCCAATCAGTGATATAAGATGGCAGCGTGGTGGTATAGATGTGGGTTCAACTGGAAGAGCGTTGTCAGGAAGCGGTGGCGAACTTTTGTTATGGCCTGCTGAACCAGCTGATGCCGGAGTGTACTCTTGCAGAGTAAATGCACCTTCAGGACAATACGCACAAAAGGATGTCCAGATTTTTGTTCGAAATCCTCCAAAAATAGCTGGATTTCAATTTCCTGCGGATTTGGTGGAAGGGAGTTCAATACAAGTTTTATGTGGAATAACTTCTGGAGATAAACCAGTTTATTTCTCTTGGCTTAAAGATGGACAACACATTCCGTCCAATTTACAGGTACAAGAGAAATCCTTAGATGAATTCAGCTTTTTGATATTTTCCCACGTCACATCGAAACATAGCGGGGATTACACATGTGTCGCCAGTAATACTGCAGCTGAAGTTAACCACACTGCGAGACTTGCCGTTAAGGGTGTAGGTAGCAGTGCAGATTTTCGCCCACTTAACCGCGTCGAGCTGCCCTTATCCGTTTTGACCAACGGCTCATTAAGTGGGGTTGCGACCCGCGGACACGAGGGACGCTATATGTGTCGTGCAGACAACGCCGTAGGACGCGGCCTGTCTAAAATTGTTACCGTATCTGTTAACGAACCAGCGCATTTTGAGTTCTCGTCTCGCAACATGTCGGTGCGACGTACGTCCAGCGCGTCGCTAACATGTTCCGCAAGAGGAGATGCTCCTATACAACTGCACTGGACACACAATATGCAGAGATTAGATCTTTCTACATATAGAGTGTCAGTATCGGAGAAGCGATCCGAGAACGGGGCAAGTTCAACGCTGAGCATAACCCACGCTCAACGCCGTGACTCTGGCGTGTACCGATGTCGAGCTGAGAACGCGTATGGTAGAGATGAACTGCTCATATACTTGGCTGTACAGGAGTTTCCGGAATCTCCTCGCGGTCTTACTATATTGAAACGTGAAGGTCGTGCGACACGTCTTTCATGGCATCGCGGGTACGACGGTAACGCACGACTCCGTGCATACCGCCTTCAGTATCGCGTTGTGGGTGACGCACGACGCGCTGCTGACTGGACAGATGCCCCTACTAGAGAACTGTCTCCCGATTTACTTATACAAAGACAAGAGAGCTCGGACCCAAACTCGGAAGTAATTCTAGAGTATCGAGTGGATGGTCTTCGTCCTGCAACAGCTTATGCGCTAAGGCTGGCTGCAGTAAATGATATAGGAGATAGCGAGTATTCCGAATCAGTCATAGTCCAAACATTAGAGGAAGCCCCATCAGAAGCCCCTAGACATGTCGAAGTGCAAGCAGATGCTCCTGGGGAACTGCTTATCAAGTGGCAGCCTCCACCCCAAGAAACATGGAACGGCGAGTTGCTTGGCTACGCGGTATGGTGGCGGACGGACAACGCATCACTAGCACTAACAGTCCCTGGATGGGCTGCAAACAACGCACGAATTGCTGCATTGAGAGCACACTCGCGATATGAACTACGGGTGCGTGCGTTTAACGCTGTAGGCGCGGGACCAGCGTGTGCACCACTTGCTACCACAACACTTGAAGATGTTCCAGAAGCGCCACCTCAGCATATCCGCTGTGAACCGTTGTCGGCACAATCGTTTCGTATTTGGTGGGAACCGCCACCACCACAGACCAGAGGGGGCTTGCTTCTTGGATACGAAATTATGTACCAGGAGGTAGGATCTTTGGACAGTGTATGGGAGCGTCGTCGCGCTGGGGGTAGCGAGGCTCAAGCCGGGGGTCTCCGAGCAGGAAACTATTCTGTGCGAGTGAGCGCGCGCACTGCCGCTGGCACCGGACCTCCCGCACTGGCGTACTGTCATACACTCGATGACGTGCCAGGACCTCCAGCAGATATAAAAGCTGTACCAAATTCAGAAGATTCAGTCATTATAAGCTGGCTTGCTCCAGCGCAGAAGAACGGCAAAATTAGACACTATACAGTATATAATCGACCACAGAGAACAGGCCAGCACTCCCACGTAATGGTGCCGCAAACGGATGAGGAACAAATGTTAGAAATACGAGGGCTGCATGAACACCAGCTATATGACTTTTGGGTGACCGCTGCTACATCTGCCGGAGAGGGAGAAATGAGCGCTATAGTTGCCAAGAAACCCAGCACTAGAGCGGGTGCTAAAATATGGTCATTTCCAAGAAGAATAGTACAAAAAGAAGGCTCTAAAGTGGTCATTCCTTGCGGAACTGCCGGTAGTCCCACCCCCGTGCGAGTATGGAGCCGGCGTCGACCACCGACTATTTTAGCCGACCCTCGAATACGAATCGAAGCACATCGACTTGTTATACCTAAAATGGAGCCTTCAATGAGTGACAACTATACGTGCGTGGTGCGTAACCCGTGGGGCGAGGAACGTGGACTGTGGGAAGTGCGCGTATTATCGCCACCCTCTGCACCCCGTCTGAGACTCACTGGTACTGCACCTGCCGCAATTATAATCGCCTGGGACTCGGGGGATGCAGATAGCTTCGGTCTAGAATACGCATTAGGCGATGGTCCGTGGCAGTCAGTGTTTGTGTGGGGTAGCGTTCGGTCATACGCCCTCCGGCGGCTGGTGTGTGGTGGAAAGTACAGACTGAGACTCACAGCAAGAAACGCTGCTGGTGTTTCGCACCCTAGTGAAGTGCTTGCCGCCGTTGCAAGCGGAGGAGTATCAAAACCAGCGTTAGAAAAACATCTAATCACGACCAACAGTAGCTGCGTTCGACTGAATTTCCTGACCTGGGATAGTAATAGTTGTCCGCTCATACACTTTATGGTCTCCATTCGATCTTTTGAGGAATCTTTATGGAGATCGGAGACCATAGGACTGGATCCTCAACCAATATCGCTATGTAGTCTACAATCAGCAAAGTGGTACCATCTAAAGGTTATCGCTCTCAGCACTGCGGGCTCTACAACTGCAACATATTATTTTTCAACACTAACCGAGGGTGGAGAGCGTATCCCAGCACCAGCTCAATTCCCACCTGGTGGTGAGGAGGACGTGTCATCGGGCAGTGGTGTAGTGCTGCTAGCCATCGCAGCTGCTCTGTTTATTGCCTTATTATTGCTTGGTGTGTTCGCCTATAAAAGAAGTGCAACAAAACCCTGCTTTAGAAAAGGCTATGAGCAAAGTGGTGTATCAGAAGAAGACAAATCGGTAGAAAAGCGCGACAATAGAAAGAATTGTCAGCAAGTATATACATCTTCCCCAATTAAAAGTGCCAATAAAAAAGAGCAACAAGAAATGTACGAGATCAGTCCATATGCGACATTCAGTATGTCGGGTGGCAGTACGGGTGAAGCTGCTGCCGGAACTCTACGGACGTTCGGTCGCGCTGAACCTGCACCACTTAGCGCCGCACCACCCCGCAACCACACGCATCGCTCACCAGCTCATTCTGATGAGTACACATTATCTCGTGCAATGACATTGATGGTGCGACGTACAGAATCGGACTCCGATTCGAGTGGGTCGCCGTGTGCCGAGTGCACCTCTAGCGTTTCGTATAGAATGCCACTTGCACCCAACAAAGCAACGGAAGAAGTGTTCCGACCTGTCACTGACTCTAGCGCGGAATCTTGTGCCGGTTCTGGGCCGAGGGACAAGGGTCGGCGGCGACCGAGGCGACATGCGCCTGCTAATAGATACCAACAGCGGCAGGAACAGGAGAGACGAGATTTTACTATACATGTTTAA

Protein sequence:

>DPOGS202953-PA
MDVPWEVQVQPTHGVSGGAALMTCAPSASVRAHVTVTRWIKDGAVLPPVPEVGGGYIIGGSRGDILVVREARSDDASSYACEAQHVLTGEKRRSPPAMIAVSHPTGSMAPRILTISEDETVPQGGDIRLVCCAIGSPPPTYSWFRHANGRLSPVGNSIRISVSEQVLIIRRAQLSDGGIWTCRAHNQYGEQRRDARLKVRSRLVVSVHPQLHIANSGSSVTFNCSVDGGEARVRWLHDGVPVGGGERVLRIHGVVRAHRGMYQCFAERDLDSAQAAAELRLGDTAPELHYTFIEQALHPGPALALHCSASGSPSPRFTWLLDMQVVEEHNTQQRSISQFMSPNGDVVSYLNITSVRPEDGGRYTCRAHNTRGAVEHSTRLNVYGPPSIRNIGPVRVVAGVNTTIYCPYSGFPISDIRWQRGGIDVGSTGRALSGSGGELLLWPAEPADAGVYSCRVNAPSGQYAQKDVQIFVRNPPKIAGFQFPADLVEGSSIQVLCGITSGDKPVYFSWLKDGQHIPSNLQVQEKSLDEFSFLIFSHVTSKHSGDYTCVASNTAAEVNHTARLAVKGVGSSADFRPLNRVELPLSVLTNGSLSGVATRGHEGRYMCRADNAVGRGLSKIVTVSVNEPAHFEFSSRNMSVRRTSSASLTCSARGDAPIQLHWTHNMQRLDLSTYRVSVSEKRSENGASSTLSITHAQRRDSGVYRCRAENAYGRDELLIYLAVQEFPESPRGLTILKREGRATRLSWHRGYDGNARLRAYRLQYRVVGDARRAADWTDAPTRELSPDLLIQRQESSDPNSEVILEYRVDGLRPATAYALRLAAVNDIGDSEYSESVIVQTLEEAPSEAPRHVEVQADAPGELLIKWQPPPQETWNGELLGYAVWWRTDNASLALTVPGWAANNARIAALRAHSRYELRVRAFNAVGAGPACAPLATTTLEDVPEAPPQHIRCEPLSAQSFRIWWEPPPPQTRGGLLLGYEIMYQEVGSLDSVWERRRAGGSEAQAGGLRAGNYSVRVSARTAAGTGPPALAYCHTLDDVPGPPADIKAVPNSEDSVIISWLAPAQKNGKIRHYTVYNRPQRTGQHSHVMVPQTDEEQMLEIRGLHEHQLYDFWVTAATSAGEGEMSAIVAKKPSTRAGAKIWSFPRRIVQKEGSKVVIPCGTAGSPTPVRVWSRRRPPTILADPRIRIEAHRLVIPKMEPSMSDNYTCVVRNPWGEERGLWEVRVLSPPSAPRLRLTGTAPAAIIIAWDSGDADSFGLEYALGDGPWQSVFVWGSVRSYALRRLVCGGKYRLRLTARNAAGVSHPSEVLAAVASGGVSKPALEKHLITTNSSCVRLNFLTWDSNSCPLIHFMVSIRSFEESLWRSETIGLDPQPISLCSLQSAKWYHLKVIALSTAGSTTATYYFSTLTEGGERIPAPAQFPPGGEEDVSSGSGVVLLAIAAALFIALLLLGVFAYKRSATKPCFRKGYEQSGVSEEDKSVEKRDNRKNCQQVYTSSPIKSANKKEQQEMYEISPYATFSMSGGSTGEAAAGTLRTFGRAEPAPLSAAPPRNHTHRSPAHSDEYTLSRAMTLMVRRTESDSDSSGSPCAECTSSVSYRMPLAPNKATEEVFRPVTDSSAESCAGSGPRDKGRRRPRRHAPANRYQQRQEQERRDFTIHV-