Monarch geneset OGS2.0

DPOGS205973
TranscriptDPOGS205973-TA4434 bp
ProteinDPOGS205973-PA1477 aa
Genomic positionDPSCF300164 - 186729-196822
RNAseq coverage387x (Rank: top 31%)
Annotation
HeliconiusHMEL0075460.086.49% 
BombyxBGIBMGA009402-TA0.083.32% 
DrosophilaCG31004-PD0.053.87% 
EBI UniRef50UniRef50_E0VDP60.060.76%Sushi domain containing protein, putative n=4 Tax=Pancrustacea RepID=E0VDP6_PEDHC
NCBI RefSeqXP_001650886.10.062.53%hypothetical protein AaeL_AAEL005432 [Aedes aegypti]
NCBI nr blastpgi|1571099350.062.53%hypothetical protein AaeL_AAEL005432 [Aedes aegypti]
NCBI nr blastxgi|1571099350.059.41%hypothetical protein AaeL_AAEL005432 [Aedes aegypti]
Group
Gene OntologyGO:00071605.6e-39cell-matrix adhesion
KEGG pathway 
InterPro domain[640-791] IPR0055333.4e-62AMOP
[244-402] IPR0038865.6e-39Nidogen, extracellular domain
[808-983] IPR0018464.5e-16von Willebrand factor, type D domain
[1101-1161] IPR0160601.3e-11Complement control module
[423-496] IPR0147562e-11Immunoglobulin E-set
[1110-1154] IPR0004369.5e-06Sushi/SCR/CCP
Orthology groupMCL13879 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205973-TA
ATGGGTGTTAAAGTTAAATTGGTAGTATTTGTAACGTTAGCTTTAAGTGTATACGGACAAGATGTTGTAAGTGATGAGAGGACAGTTAGTGATAACGTTGATAAGGAAATTTCGGATAATATCACGGATTCGATACAAAATGCAAAAGTAGAGGATCCTGTAGAAATTATAACTGGTAGTGATATCGAAATAAAGAGTGGTAAATATCAATTTGCGAACGATGGACTTACAGGCGACGAGCCTGTTGATTTGGAAGCGGTAGACTTAAGTCCAGGCGAATCGCAGAACGAGCGACAGGTTTTGGGACCGGCGACCACAACATCCATAACTAATACGGAATATGCACACATCGATGGCAGAGTCCTTCCGTCAACTAGTTATCAAAACAATGGCCAACCATATGTCATCACACAGCAAAGACTGGCTCAAATCAGATCGAATTTTATGTACTGGTTCTACGACCAAGGTGGTAACGAGAATATTGGAGACTATCAGAGAGATATCCACACGTCAACACCTCAAATTCACAAAAATTTCAACTTCCAACTCCCATTCTTTGGATTCAGGTTTAATTACACAAGACTATCGATGAATGGCTACATATACTTCAGCGATCCTCCAGACCACTACACCTATCCTCTGTCTTTCCCCATCAGAGATTGGCCAGCTATGAATGACCCTTCATTTATAGGTATCTTCTTCAGTAAATGTCGTATAGGAAGCCAGAGGCCCGAAGACCCTGATCAGAGAAGACCTGGAGTTTACTTTAGAATGGACAGAGACCTGCAGACTCGAACAGATCAGTTAGGTGTGGAGATGAGGGAGAGGATAACTTGGGATATAAGGCAGGGCGTCATTGGTTCAGAAAGTTTCTTCCCAAAACACGCTATAACTATAACATGGAAAAACATGTCCTTCGCTGGAGGAATAGACAACTCACTATTTGTTACGAACACCTTTCAAATGGTATTAGCTACTGACGAGGTATTCACCTACGCAATATTCAATTATCTCGAAATCAACTGGAGTTCTCACACTGAAGCTGGAGGTGATACGACTACTGGTGAAGGTGGTGTACCAGCTTATATCGGATTCAATGCGGGGAATGGCACTAGAAGTTACGAGTATAAACCCTACTCACAAGCCTCCGTACTTAGAGACTTGACTGGAAGAGGCTGGGCTAATGGATTTCCAGGAAGACATATCTTTAGAATTGATGAGAATATCCTCATGGGTACTTGTAATAAGGATATAGATGGTGCAAATCTACCACTAATGTTCGCCCCTGAAAGTGGTAACATGCTTGGTGGTACCGTCGTCAATATTACTGGTCCCTGCTTCCAACCAACCGATCGGATCTCTTGTCGCTTCGACACTGAGTCAGTCGTTGGAGCGGTTGTTGACTCTAACAGAGCTATTTGTGTTCAACCCCGATTTTATCACAACGGTTACGCCAGATTCGAAATTGCAATAAATAACGAACCCTACAAATGGAAAGGAAAATTCTTTGTTGAAACCCCTGGAACTGCTACCGAAAAGATATTCTTCCCTGATAACTCTATTCACGAAAGATATCCACCAGAAATACGAATTAGATGGGACCGGTTCAATTTAACGACCAACCTGAACGTCCAACTACAAATTAGTTTATGGGGCTACAAGGAAGTCACCATCAGACCTCAAATGGAATTCATTGACATGATTGAGACTGGTGTAGCGAATACAGGGGAATACGTCATCAATCCCCAAAACTTCCGCAACAGGGACAACATCATGCACAACGACATGCAGTTTGGATTCCTTCAAATTAATTTAACAACTCCAGAAATATACAAGGATGTCGCTATATCACCTGTATTATGGAGTCGTCCGATTCCCCTGGGCTGGTATTTTGCGCCGCAATGGGAGCGAATGTACGGACAGCGCTGGTCTCAATCCATGTGCAACAACTGGCTGAGAACTGACAGATTCCTCAAAAATTATGCAGCACAAGTATGGGTGTGCCCATGTACACTAGAACATGCCCTCCTAGATAAGGGAAGGTTCATGCCTGACCTGGACTGCGACAGGGACATCAACCCTACATGTAGATATCATTGGGGAGCCATTCACTGTGTTAGAAGCGGTGGACCCAGTTCGGAAGGATCGGGTCAACAATGCTGTTACGACAAAAACGGTTTCCTTATGCTTTCCTACGATCAGATGTGGGGATCCAGGCCTCGTCGGTCCCATGACTTCGGGTTTACTCCTTACAATGAAGCTAATAAAGTACCATCTTTGTCTCATTGGTTCCACGATATGATACCCTTCTATCAATGCTGTATGTGGCAAGAGGAACAGGCAGTCGGCTGCGAAACGTTTAGGTTCGAGCGTCGACCATCTCAAGACTGTGTTGCGTACCAATCACCAGGGGTAGCTGGTATATTTGGAGACCCACACATTGTGACCTTCGACGGTCTTCAGTATACATTTAACGGCAAAGGTGAGTATGTACTGGTGAGAGTTGACCGTCCACAACTGAAGCTGGATGTTCAAGGTCGCTTCGAACAAGTGCCCAGAAATATTTATGGACCAGTGAACGCAACGATGCTTACCTCCATTGTGGCCGCTTCTAACAACTCCGTACCTATAGAGGTAAGACTAAGACCGCAGCATGCTCAATGGAGATACCGTCTAGATGTGTTCGCTGACAACAAAAGGATTTATTTCGACAGAAGCGCTCTAAGAGTTCAATATTTCCCAGGTGTGACCGTATATCAGCCCATGTATATTCTTAACCAGTCGGAAATCGTCATCATGTTTTCGTCGGGTGCTGGAATAGAAGTCATTGAAAACAAAGGCTTCATGAGCGCTAGGGTCTACTTGCCTTGGACCTATATGAATCAAACACGAGGTCTTTTTGGAAACTGGTCTTTGGACATCAACGACGACTTCTTACGGCCAGATAGCACTATGGCTGCTGTCGACCTGAACAACTTCCAATCCGCTCATAGAGACTTCGCTCAACACTGGCAACTAACAGATAGGGAACAACCAAATATAGGGGTGGCGTTGTTTGTCAGAGAATACGGAAGAACAGCGGCGTATTACAACGATAATCAATTTATACCGAACTTTATACGAGAACCGGTGAATTTCTTGCCATCGAACCGATCTCAGGACGTGATTAGAGCGACCGAGATCTGTCAAGATTCCTATCAATGTCGTTACGACTACGGCATGACCTTGAATAGAGATATGGCTGAATTCACAAAGAACTATTTATCTTCTATTACAAATATTAAAGAGCAGAATGCCCGTCGCGTGATCAGTTGCGGGGTTTTGGAGACACCACGCTTTGGACGTAAAAGCAACTTTTTCTTCACTCCCGGCACGAGAGTTAACTTCGAATGTAATCAAGACTTTATTCTAATTGGTGACAAGAGACGTGTGTGCGAGGACAACGGGAGATGGAATTTACCAGACTACGGCTATACTGAATGCTTACGTAATCAGGAATACTCACAAAGAGCGTTGTTCTTGACTTGGGGCATCATAGTGGCGATTATTGTACCATTAGCTCTCCTCATTTGTCTGCTTTGGTTCTGGTGTTATTTCAAACCAAAGTCCGAAGGCAAAGATACATTCCGATTCGAAGACATACCGCGGTCTAAATCGGCTTCGCGTCTCAACCTTAGGTCAACATCGATGGGAAATATCACTGATACTATGAGATCCTCAACCATGCATAGCCAGGATACTGATAAACCAAAACTCCCTGACACTCCCACAGAAATAACCCCCATGACAGCCAATGTAACTAGAACTGCCCCGCTCCCTCCTGAAACCTTAGACGGTGAAAGTTCAGGAATAGGTTATACGGATTCTAATAAAAGCGACAGATCTGACAAATCCAAAAAACGACGCGCTTATGACAAAACTTACAGAACCAATGAACCCTTGCCCAATGCCCCTGAAGTAGAATTTCCTGAAAAGCAATGGGACTTATCGGAGGAAGACCTGTTATCCCTAACAACTCCTTCGGAAACAGAATCTAATCGAGATTCTACATTAACAAGACCTGCTAAAGACATCAAATATTTGAGTCGCCCTCGCCAAACGGGTCGTCATGCTTTGCAAACTGACTCTGGTTACTCAACTAAAGACGGTTCAGAAGATCCATACTCTCCCAAATATGACGGTCAATATAGTCCAATACCTTCGGCATATTCTCCCACTTACTCTGAAATTTACTCTCCACCTATCAGTCCCACATCTGATAACAGTCCCAGGAATACTTACAATAATCCTGGCTTGCCGGAACCTCCAAAGAGCGCCCCCGCAACTGAAATAAAAACATTCACAATGCCTCCAAAGAGAGGAAAATCTATCGAGACACTCATTGACCCTCCTACGGCAGAAATGCCATCATTCAGTCGATCTACTATGGTATAA

Protein sequence:

>DPOGS205973-PA
MGVKVKLVVFVTLALSVYGQDVVSDERTVSDNVDKEISDNITDSIQNAKVEDPVEIITGSDIEIKSGKYQFANDGLTGDEPVDLEAVDLSPGESQNERQVLGPATTTSITNTEYAHIDGRVLPSTSYQNNGQPYVITQQRLAQIRSNFMYWFYDQGGNENIGDYQRDIHTSTPQIHKNFNFQLPFFGFRFNYTRLSMNGYIYFSDPPDHYTYPLSFPIRDWPAMNDPSFIGIFFSKCRIGSQRPEDPDQRRPGVYFRMDRDLQTRTDQLGVEMRERITWDIRQGVIGSESFFPKHAITITWKNMSFAGGIDNSLFVTNTFQMVLATDEVFTYAIFNYLEINWSSHTEAGGDTTTGEGGVPAYIGFNAGNGTRSYEYKPYSQASVLRDLTGRGWANGFPGRHIFRIDENILMGTCNKDIDGANLPLMFAPESGNMLGGTVVNITGPCFQPTDRISCRFDTESVVGAVVDSNRAICVQPRFYHNGYARFEIAINNEPYKWKGKFFVETPGTATEKIFFPDNSIHERYPPEIRIRWDRFNLTTNLNVQLQISLWGYKEVTIRPQMEFIDMIETGVANTGEYVINPQNFRNRDNIMHNDMQFGFLQINLTTPEIYKDVAISPVLWSRPIPLGWYFAPQWERMYGQRWSQSMCNNWLRTDRFLKNYAAQVWVCPCTLEHALLDKGRFMPDLDCDRDINPTCRYHWGAIHCVRSGGPSSEGSGQQCCYDKNGFLMLSYDQMWGSRPRRSHDFGFTPYNEANKVPSLSHWFHDMIPFYQCCMWQEEQAVGCETFRFERRPSQDCVAYQSPGVAGIFGDPHIVTFDGLQYTFNGKGEYVLVRVDRPQLKLDVQGRFEQVPRNIYGPVNATMLTSIVAASNNSVPIEVRLRPQHAQWRYRLDVFADNKRIYFDRSALRVQYFPGVTVYQPMYILNQSEIVIMFSSGAGIEVIENKGFMSARVYLPWTYMNQTRGLFGNWSLDINDDFLRPDSTMAAVDLNNFQSAHRDFAQHWQLTDREQPNIGVALFVREYGRTAAYYNDNQFIPNFIREPVNFLPSNRSQDVIRATEICQDSYQCRYDYGMTLNRDMAEFTKNYLSSITNIKEQNARRVISCGVLETPRFGRKSNFFFTPGTRVNFECNQDFILIGDKRRVCEDNGRWNLPDYGYTECLRNQEYSQRALFLTWGIIVAIIVPLALLICLLWFWCYFKPKSEGKDTFRFEDIPRSKSASRLNLRSTSMGNITDTMRSSTMHSQDTDKPKLPDTPTEITPMTANVTRTAPLPPETLDGESSGIGYTDSNKSDRSDKSKKRRAYDKTYRTNEPLPNAPEVEFPEKQWDLSEEDLLSLTTPSETESNRDSTLTRPAKDIKYLSRPRQTGRHALQTDSGYSTKDGSEDPYSPKYDGQYSPIPSAYSPTYSEIYSPPISPTSDNSPRNTYNNPGLPEPPKSAPATEIKTFTMPPKRGKSIETLIDPPTAEMPSFSRSTMV-