Monarch geneset OGS2.0

DPOGS212139
TranscriptDPOGS212139-TA5202 bp
ProteinDPOGS212139-PA1733 aa
Genomic positionDPSCF300038 + 146538-159177
RNAseq coverage1308x (Rank: top 10%)
Annotation
HeliconiusHMEL0077330.065.16% 
BombyxBGIBMGA006589-TA0.064.32% 
DrosophilaCLIP-190-PA3e-9427.27% 
EBI UniRef50UniRef50_G6DL510.098.23%Putative restin n=2 Tax=Coelomata RepID=G6DL51_DANPL
NCBI RefSeqXP_967018.27e-14733.95%PREDICTED: similar to restin (Reed-Steinberg cell-expressed intermediate filament-associated protein) [Tribolium castaneum]
NCBI nr blastpgi|3454940454e-15031.44%PREDICTED: CAP-Gly domain-containing linker protein 1-like [Nasonia vitripennis]
NCBI nr blastxgi|3454940450.031.12%PREDICTED: CAP-Gly domain-containing linker protein 1-like [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[46-145] IPR0009381.8e-32Cytoskeleton-associated protein, Gly-rich domain
Orthology groupMCL10394 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212139-TA
ATGCTTGTGCACGTCGGATTTGGAACAGATTTAATCGAAGCTGAAGAAGACGAGGTTAGCAGCAGCCTTCCGGAGAGACGACGGTCCTATCGCAGAGTATCCACTAGCAGCAGATCGAGTGTTACTTCGATGGATACGCTTTGGGAGAAATATCCGCGTCGTCTCAGCGAAGCAGGCCTTAGACGGTCCTCAGATCACAGTGTCGTATTGACTGAAGACACTGACAGTTTTATAATCGGGGAACGTGTTTGGGTCGGCGGAACTAAACCGGGGTTGATAGCTTATATCGGTGAGACACAGTTCGCTCCCGGCGAATGGGCGGGCATTGTTCTCGACGATCCTATAGGTAAAAACGACGGATCCGTCGCTGGAGTCCGTTACTTCCAGTGTCCTGAAAAACGTGGAGTTTTCTCTCGACTCACAAGATTGACCAGGGAACCCTTGGCATCCCACGGACCGCACGATGCCTCTCCCATATCTGATGCTGGGAGCGTATTTGAACGACCGCCCTCTGGATCTGCTAGACCAAGACGTGCACATTCTCCAAACGGCAGTGTACGAAGCATGGTCAGCAGTAAGATGAACGCTTCTATTTCAACAACTACCAATGGAGACCTCCGTCTTGGCGATCGGGTCATAGTATCTAGTAGCCGGGGAAGCAAAGCCGGTACCTTACGTTTTGTAGGACCTACCGAATTTGCTTCCGGTGTATGGGGAGGCGTTGAACTCGATGATCCTTTAGGGAAGAACGATGGTTCCGTCGATGGAAAGAGGTACTTCGAGTGCTCGCCGCGTTTCGGTCTCTTCGCCCCGATATCGAAGGTGTCCCGTTCTCCATCAAACCGCAAGCCAGGTTCCTGTGCGATTCACAGCAACGGTCGCGCGACACCGCTCAGACGCTCCAACTCTCGCGAGTCGCTCACATCTCTCGGGACATCAATCGCGTCTTCCCGCGCGGGGGTGAGGCTCGGGGTGACGTCACTGGGCGCTCAGGAATTATTACGAGAGAAGCAAATGCATTTGGAAAAGTTAATGCGGGAAAGAGAGTTGGAAAGGGCAGAGGTCGCAAAAGTATCGATACAAGCCGATCGATCCGAGACTGCATTAGCACAGATTAAAAAGGAAGCGACGCAGATAAACAACGAGAATGCGAAACTCAAAGCTGAACTGGACAAACTTAACAAAATGTTGGAGGATGAACGGCAGAAGGTCGAAGATCTCATGTTTAGGAACGAAGAAGAAAATATCAACAAGGAAGACTATAATATTGAAAAATTGGAACGTGAAAAACTTATACGAGATCTCGAAGCGGAGGTGGCGCTTCAAGCGGCTCGAGCTGAAACTACGGCAGCGACTTTACGGGTATTAGAAGATCAAAGAAGCGCTGAAATGAGCGCTTTAGCTGAACAACATAAAGAAGAATTGGTAGCTGCACAAAAACTGCAAAAACTCCTGGATGAAGCATATGCATTGCTTCAAGAAAAAGAATCTGAAAAGGATTCTCTTGGTAAGAGCTTGTCGGAAGAGCTTTCTAAAGTCAAGATAGAATCAGAGAGAGCTTTGAACGAAGCTAGAACTAAACTTGCCATAGCTCAGACTGAGTTTGAAACACAAGTCTCAGTTATGAACGCCAAGCTGCAGTTGGCTGAATCAAAACTGGAGACGGAGAAACAAAACGTGGAGAGGTTAAATAAAGACAGTAGCGAGATTGTTATAGACTTGAATAATAAACTGATCGCTCTACAAGCTATGGTCGACGACAAAACATTGGAATTCAATAAAGTAACTGGCGCGAGCAAAGAACACGAAGTTAACTTAAACAAGGAAATAACGAAACTGAAAATGGAACTCAGCGCTAAAATTCTTGACTTGGAGCAGTTAGAAGACGCGAAGAAGAAACAGGAAACGCTTTATAAGGCACTCGAAGACGAAGCCAAACGTGTCCAAGACGAACTTTCAAGTAAGGTCAGCGAATATGAAAGCATTTTGAACGAAGCGTCACAGAAGGAAGAAAATAGTAAATCTGAGATCATGAAACTACAACAGGAAATTAACACAAAAACCAAAGACTATGAGAAACTGCGCAATGAAACGAGCCAATCAATAAACGCTAACGAGAAGCTTATAGACGAACACAAACAAACAATTCATGAGAGAGATAAAGAAATTATTAAATTAAAAGATGAGTACGAAAATATGACAGCAAGTTTTAACATCAAACACACTAAGATAGCCGAGGAACATAAGAAAGAAATTGAAAATCGCAATACCAAGATCGAAGAACTGACTAAGGAGATAGAAAATCATAAACAGGCTTTAAATCAAAGCAAAATTGAACTCGACACTTTGAATACGCAGTTCACGTTGAATAATGACGAACTAAACGCGCTCAAGGAAGAAAATATGAAATTAAAACAATCGCTCAACGAACTCACGAACGCGAACACAGAACTCAAATCACAGATTGCCAAAATGGAGCTCGAAATCGGTGAATATAAACGTCAACTCGCCAGCTCCATAGAACGCTGTGAGGAAATACAAAAATCCAAAGAGACGATCGAAACAGAGTACATGAATCTCACTGGACAGACGACAGATTCTAACGAGCAATTCAATAAGCTCTCACAGCATCTTAAGAACACTGAAAATGAACTTCAAGCTATGAAAGATAAATACAGAGATGCCTCCAACAGCTGCGGAAGAATTGAACAAGAATTTAAACAAAAAATATTTAAACTACAAGAAAACTTCGCAATGGAAAGAGGTCAGTTGGTCAATTCCATCGACGATTACGTTCAGAAACTGAATGCATCGGAAAATAAGATAAAAGAATTTGAAGTTTTAATCAATGAAGCGACTAACCGTTTACATGAATTCGAAAATCAAAATGATAAATTACTAGACGAAAACATGATATTGAAAACTGAAATAGAAAATCTTAGAATTAGAGAGCAAGAAATAAACAACGAACACGACGCTGTACGCAAGAATATAGAGATTGAACTTGAAAAACATAAGGAGGAAATAACCGCGTTGAAAGCTGATGGGGCGACGTCTGAAGTAAAGTTAATGGAAAAAGTAGACCAATTGACTGAGGCACAGAATGATTTGAATAATAAACTGGAAGAAGCTCGCAAACATGAAGATTCCTTGCAAAAACTCTTAGAGGATATGACCACGCAGGTCGATAACCAAAAACATCAGTATGAGAAGGAGGTCAATCACCTGCAAGAGCAATTAGTATCAGTTAACGAAACACTAAACGCGCAGAAACAGAGTGAGTTACAATTGAAACAGGCTCTCGAAGACAAACAGAACGCTATTAAAGAATTGTTCTTAAAAGTTGAGATGTTAGAAGTGGATGTTAGATCGAATTGTGAAATTATAAACGAAAAAAATAATCAACTTTCTCAAGCTAATGAGGAGCTGAATAAAATGACAGATTTGAAGAATAGAATCGAAGAGCAGCTAAATTCATCTCTACTGGAGGCGTCTAAATTTAAACAACAATACGAAACGCTGTTACATAATTCAACGGCTGGAGAGTCACTGCTGAAGGAACAGCTCGACCAGTTAGAGTCGGTTAAAACTCAGTTAGGGTCCATATCTAACGAGAAGACTTTACTAGAAGAGAAATATAACGAAACCTGTAAAGAAGTTGCAAATTTGAGAAACGAACTCGAAGATACTGGCAAGAATTTAAAAGATAACCTTGAAATAATCATAGAACTGAAAAGAGATCTTAGTGAAAAGGAACTCGTCATCAGGTCACAGAATGAGAAAAATGACACTTGTAATAACAAAAGCCAACTATTGACAGAAGATATATTAAAACTACAAGAGGAAGTTAACAATAAAAACAACCTGTTGGAACAAAAACAGAAAGAATTAACAGTCCTTAATGAAACTTTGCTGTTGGAAACTAAGAAAACACAAGAAACTATTCGACGTTTAGAAAACGAAGGAACTGAATTAAAATCGAAGTATGCGAGTGAAACTGAATCATTAAACAATACTATCAAAACATTACAACATCAACTAGACGCACAGCAACAGCTGCTCGGAGAACTTCAAAGTTCAAAAGAAAAAGTTAGTGATTTGGAACAGTTGCTGTCCAAATCCGAAAACGACATAAAAAAACTTACAAACATCAACGAAGCCCAGAAAGTGAATTACGAAGATCTCAATAAACAATTACAGAAACAATATGACGATTACAAAAAAGATAGCAAAGCTATAAGAAACGACCTGAAGAATAAAATTAATGATTATGAGAAACAACTACAAGATTCAAAAGACAGAGTGGCGTCGGAACTAGACGAACAGAATAAACTCCGAGAGAAACTTGTTGAGGCCGACAACAAACTGTTGGATTTATCACAAAAGCTCGAACTGATATCCGTTCAGCAAAGTAATAATGAGAGCAAAGACGAGAGATTAGAAAAACTGACTTTGGAACTGCAGGAAACGAGAAGAAATGGAGCCGAGGCGTTAGCGAATAGTGAAAAAACGATTGCCAAATTAAGAGTAGACATTGAAAATAGCATTAGAGATATTAACGTTAAGGATAATCAAATAAAGCAATTGCTGGAAGATTTGAAGACTCAGAAGGCAAAGGTGGAAATAGCAGAAAGAGAGAAGGTCATTTTGCAAAAAGAAATGGTACAGAACAGTAAAGACGTAAGAGACAAAAACGATAATGCAGCGGGTCTCTTGGGACAGGGCGATACAGCTCAACAAAAGGCTGATGATAAAGAAGTTATAGACGGACAGGTGAGCTTCTTGAACTCTGTGATAGTAGACATGCAACGAAAGAACGAACAGTTGACGGCGCGCGTGCAAGCCCTAGAGGGAGCCACGGTCGCCCCTGAGCCGCCATTGTTCAACGGTCGCAAGACTCGTGCGCCGCGACTGTTCTGCGACATCTGTGACGTGTTCGACGCTCACGACACAGACGACTGTCCGCGACAGTCCGCCGAGCCTACCCTCCCCACGGAGCGCAAACCTCCGCCTCCGCCGAGACCCTTGACGCTATATACACTTCCATCAGTTGTAACCGCTAAAGATTCCCACTACGTTATGAGATCAAAGACTTTCGCGAGTGTAACCGAAACGTTGGGAGGATGTAGTTTTAAAAATGATAAAATAACTCGTAGTAATCCGAAAATATTAGTTTCTCTTGCACTACGTTTTTGTCACTAA

Protein sequence:

>DPOGS212139-PA
MLVHVGFGTDLIEAEEDEVSSSLPERRRSYRRVSTSSRSSVTSMDTLWEKYPRRLSEAGLRRSSDHSVVLTEDTDSFIIGERVWVGGTKPGLIAYIGETQFAPGEWAGIVLDDPIGKNDGSVAGVRYFQCPEKRGVFSRLTRLTREPLASHGPHDASPISDAGSVFERPPSGSARPRRAHSPNGSVRSMVSSKMNASISTTTNGDLRLGDRVIVSSSRGSKAGTLRFVGPTEFASGVWGGVELDDPLGKNDGSVDGKRYFECSPRFGLFAPISKVSRSPSNRKPGSCAIHSNGRATPLRRSNSRESLTSLGTSIASSRAGVRLGVTSLGAQELLREKQMHLEKLMRERELERAEVAKVSIQADRSETALAQIKKEATQINNENAKLKAELDKLNKMLEDERQKVEDLMFRNEEENINKEDYNIEKLEREKLIRDLEAEVALQAARAETTAATLRVLEDQRSAEMSALAEQHKEELVAAQKLQKLLDEAYALLQEKESEKDSLGKSLSEELSKVKIESERALNEARTKLAIAQTEFETQVSVMNAKLQLAESKLETEKQNVERLNKDSSEIVIDLNNKLIALQAMVDDKTLEFNKVTGASKEHEVNLNKEITKLKMELSAKILDLEQLEDAKKKQETLYKALEDEAKRVQDELSSKVSEYESILNEASQKEENSKSEIMKLQQEINTKTKDYEKLRNETSQSINANEKLIDEHKQTIHERDKEIIKLKDEYENMTASFNIKHTKIAEEHKKEIENRNTKIEELTKEIENHKQALNQSKIELDTLNTQFTLNNDELNALKEENMKLKQSLNELTNANTELKSQIAKMELEIGEYKRQLASSIERCEEIQKSKETIETEYMNLTGQTTDSNEQFNKLSQHLKNTENELQAMKDKYRDASNSCGRIEQEFKQKIFKLQENFAMERGQLVNSIDDYVQKLNASENKIKEFEVLINEATNRLHEFENQNDKLLDENMILKTEIENLRIREQEINNEHDAVRKNIEIELEKHKEEITALKADGATSEVKLMEKVDQLTEAQNDLNNKLEEARKHEDSLQKLLEDMTTQVDNQKHQYEKEVNHLQEQLVSVNETLNAQKQSELQLKQALEDKQNAIKELFLKVEMLEVDVRSNCEIINEKNNQLSQANEELNKMTDLKNRIEEQLNSSLLEASKFKQQYETLLHNSTAGESLLKEQLDQLESVKTQLGSISNEKTLLEEKYNETCKEVANLRNELEDTGKNLKDNLEIIIELKRDLSEKELVIRSQNEKNDTCNNKSQLLTEDILKLQEEVNNKNNLLEQKQKELTVLNETLLLETKKTQETIRRLENEGTELKSKYASETESLNNTIKTLQHQLDAQQQLLGELQSSKEKVSDLEQLLSKSENDIKKLTNINEAQKVNYEDLNKQLQKQYDDYKKDSKAIRNDLKNKINDYEKQLQDSKDRVASELDEQNKLREKLVEADNKLLDLSQKLELISVQQSNNESKDERLEKLTLELQETRRNGAEALANSEKTIAKLRVDIENSIRDINVKDNQIKQLLEDLKTQKAKVEIAEREKVILQKEMVQNSKDVRDKNDNAAGLLGQGDTAQQKADDKEVIDGQVSFLNSVIVDMQRKNEQLTARVQALEGATVAPEPPLFNGRKTRAPRLFCDICDVFDAHDTDDCPRQSAEPTLPTERKPPPPPRPLTLYTLPSVVTAKDSHYVMRSKTFASVTETLGGCSFKNDKITRSNPKILVSLALRFCH-