Monarch geneset OGS2.0

DPOGS201752
TranscriptDPOGS201752-TA3729 bp
ProteinDPOGS201752-PA1242 aa
Genomic positionDPSCF300279 - 52663-65758
RNAseq coverage953x (Rank: top 13%)
Annotation
HeliconiusHMEL0067110.081.66% 
BombyxBGIBMGA002649-TA0.082.28% 
DrosophilaCG5366-PA0.056.76% 
EBI UniRef50UniRef50_Q9VKY20.056.76%CG5366 n=15 Tax=Endopterygota RepID=Q9VKY2_DROME
NCBI RefSeqXP_393409.20.064.65%PREDICTED: similar to Cullin-associated NEDD8-dissociated protein 1 (Cullin-associated and neddylation-dissociated protein 1) (p120 CAND1) isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838610770.064.59%PREDICTED: cullin-associated NEDD8-dissociated protein 1-like [Megachile rotundata]
NCBI nr blastxgi|3838610770.064.70%PREDICTED: cullin-associated NEDD8-dissociated protein 1-like [Megachile rotundata]
Group
Gene OntologyGO:00054881e-249binding
KEGG pathway 
InterPro domain[795-1218] IPR0119891e-249Armadillo-like helical
[4-1221] IPR0160248.5e-193Armadillo-type fold
[1050-1215] IPR0139324.5e-62TATA-binding protein interacting (TIP20)
Orthology groupMCL11346 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201752-TA
ATGGCTAGTGTTTCGTATCAAATCGCTAATCTTCTTGAGAAGATGACATCAAATGACAAAGACTTCAGATTTATGGCGACAAATGATTTGATGACTGAATTGCAAAAAGATAGCATTAAGCTCGATGATGACTCTGAGAGGAAAGTCGTAAAAATGCTTCTTCGCCTGTTGGAAGATAAAAATGGAGAAGTTCAGAATTTAGCTGTTAAATGTCTAGGACCTTTGGTAAACAAGGTGAAGGAATGTCAAGTGGAGGGTATTGTTGATACCCTCTGTGCTAATATGTTGTCAGACACTGAACAGTTAAGAGATATTAGCAGTATTGGACTGAAAACTGTAATCTCGGAGCTTCCATTGGGCTCTAATATTCTTGCTGCAAATGTATGTAAGAAAATTACTGGGAGGTTAAGCAGTGCTATTGAAAAGCAAGAAGATGTGTCAGTCCAACTCGAGGCTTTAGATATCCTGGCTGATTTACTCAGTAGATTTGGCGGCTTGCTAATAACCTTTCATCCAATGTTATTAGACTCCTTACTTCCTCAACTGGCATCTCCCCGTCAGGCTGTACGTAAACGTACAATCGTTGGTTTGTCTCATCTTGTTATGTCCTGTAATACTTCTCTATATAACAAACTGATTGATCACTTATTGGAGGGTTTGAGTACTTCAACATCATCAAGTGTCAGTCGTACCCATATTCAATGTATTGCTTCAGTTGCACATGGTATACATCGTCAAGCCGGGCATAGATTTGGTGAGCAGTTGTGGAGAGTGGCTCCTCAGGTTTTGAAACATTCAACAGATCAAGATGAGGAGCTACGTGAACATTGTCTTCAAGCCTTGGAAGCATTTGTTTTGAAGTGTCCTAAAGAAGTGCAGCCCCATATTCCAACTATAATTGATCTATGTCTAAAAATGATAACCTATGATCCAAATTACAACTACGAAGATGATGAAGAAGGTGGCGGTGGTGGGGAGGATGAGGAGATGGAAGATGAATCTTTTGAACTGGCCGAGCCCGAGTCGGACTCGGAGGAATACTCTGACGATGACGACATGTCCTGGAAGGTGAGGCGCGCTGCTGCTAAATGTTTGGAATCCGTGATCTCAACGCGTCACGAGCTTTTGGCTGAAATGTATGTGACTGTATCACCTGCATTGATTGCAAGATTTAAGGAGCGTGAGGAAAACGTGAAGTGTGACATTCTATCGGCGTACACAGCGTTACTGCGGGCGACTCGCCCTCCTCCAGCTCTCCACACGCCCTTGATACCGGCGGCGGACAGCCCCCAGGCTCTACTGCTACAGAGAGCGCCCGCCTCCGTGCGTGCTCGAGCGTGCGCCCTCGCTCTGTTAAGGGAGCTGTTGGCGGCCGCCCCCGGCTGCCTCGCCGACCACGCCGCCAGGGTGTCGCTGTTTCCGTGCGCGTCGTCTATGAAGATCGAGACGCTGGTGTTCGTCGTGTGGCTGGTCCGCGGTCACGCCCCCGACTGTATGCGACCCCATGTGTCGGCCCTCCTGCCGGCCGTGCTGGCCTGTGTTCACGACCCCTTCTACAAGGTGACCGCGGAGGCCTTACACGTCCTGCAGACGCTCGTCAAGGTCATGCGCCCACTCGAGGACATCTCGAGGGGTGTGTCCGGTGTGTCGGGGGTGGGGGAGCGCGAGGTGGGGGACTGGGTGCGCGGTATGTACGACTGTACCCTCGTCAGGCTCCGCGCCACGGACATGGATCAAGAGGTCAAGGAGCGAGCCATCGCCACCGCCGGACAACTCATATGTCACTTTGGTGATTACTTAGAGAATGAGCTGCCAGTGTGTCTCCCGATATTCTTGGAGAGATTGAGAAACGAAATAACGCGGTTGACGACTGTGAAGGCTCTCACTAAAATTGCTTCCTCACCCCTACGTATTGACCTGCGGCCGATATTGAGCGACGCAGTACCGATTCTAGGCTCTTTCTTGAGGAAAAATCAGAGGGCCTTAAAACTGTCAACGTTGGTTTTGTTGGATACTCTCGTACAGAACTACAGTAACGCTATTAGTATAGAGCTACTGAGTAAGGTGCTGATGGAGGTGCCAGCCCTAGTCTGCGAGGCGGATCTGCATTGTGCTCAGACAGCGCTAACTCTTGTCCGCGGCGCCTGTGAGAGATGTCCGGCCGCACTCACACCGGATGCACGACATGCACTCACACCGAACATACTGGCATTGGCAAGATCCCCGCTGTTGCAAGGTGGCGCGTTAAAGGCGATGGTGGGGGTCCTATCCGCGCTGGTCGCGGCGGACGTGGCGGGATGCGGCCTCGGGGTGCGGCCGCCGCGAGCTCCTGGCGCTGCTGGTGGCCCCCGTGCACGACAACCCCGACCATGCTGCCACTCCGCATCATATCAAACAAGTAATGCATATCACTCGCTGGCGAAGTGCGTGGCGGCGGTCGTGGTGTCCGGGGGCTCGGACGCGCTGGACATCACACGCGGCTTCCTGAAGGACGCGGCGCAACCACGCTCTGATACACATCACATGTTCGCACTACTGGCGCTGGCGGAGATAGGCCGACATCTCGATCTTAGTTCTATTCCGAATCTGAAAGAGGTGCTATTGAGTTCCTTCACGCCATCCTCTGAAGAGGTGAAGTCGGCTGCGAGCTACGCGCTAGGATCAGTGGCCGTAGGAAACTTACCAGAGTTCTTACCGTTCATACTCAACGAGATCGAAGCGCAGCCCAAAAGACAGTATCTGCTTCTGCATTCACTTAAAGAGATTATAGCTTGCGAGTCCTGCACACCAGAGAGCGTGGAGGCGTTGAGGCCTTTCATACCAGAGATTTGGGTACAGCTGTCTAAGCATTGCCAATGTGCAGAGGAGGGATCACGCAATGTTGTTGCCGAATGTTTGGGAAAGCTATGTTTATTAGAACCTCAGCAGCTTCTGCCACATTTGAAAGAGTTTTTGAAGTCGTCTGAACCCCTCACAAGGACTACCGCCGTCACCGCCGTCAAGTTCACTATATCGGACCAGCCTCAAGCCATCGACAGTATGTTGCGATCGTGTATGTCTGAGTTGCTGGTTCCCCTCCGCGACTGTGAGCTGGGTGTGCGGCGCGTGGCGCTGGTCGCCTTTAACTCGGCAGCTCATAATAAACCGTCGCTGGTGCGAGATCTGCTGCCACAAGTCCTGCCGACCATCTACGCTGAGACTAAAGTCAAGAAAGAACTGATAAGAGAAGTAGAGATGGGTCCGTTCAAGCACTCCGTGGACGACGGCCTCGACCTGCGCAAGGCGGCCTTCGAGTGTATGTACACTCTGCTGGGCACGTGTCTGGACCGCATCGACGTGTTCGAGTTCCTGAGGCACGTGGAGGACGGCCTCCGAGACCATTACGACATCAAAATGTTGACGTACCTCATGTGCGCCAGGCTCGCTCACTTGTGTCCGGCGGTGGTGCTGCAGAGACTGGAGAGTTTGGTGGAGCCGCTTCGTGCTACATGCACTATGAAGGTGAAGGCTAACTCTGTTAAGCAGGAGTATGAGAAGCAGGACGAGCTCAAGAGATCAGCTCTAAGAGCTGCCGCCGCCTTGCTACAGATACCTGACGCGGACAAAAACCCTCACTTAATGGACTTCGTAACTCAGATCAAGTCATTCCCAGACTTGCAGCCCATATTCGAGTCGATCCTGAAAGATTCTTCAGGCGGCGTGGATTCCAACCTCATGGATCAGAGCTAG

Protein sequence:

>DPOGS201752-PA
MASVSYQIANLLEKMTSNDKDFRFMATNDLMTELQKDSIKLDDDSERKVVKMLLRLLEDKNGEVQNLAVKCLGPLVNKVKECQVEGIVDTLCANMLSDTEQLRDISSIGLKTVISELPLGSNILAANVCKKITGRLSSAIEKQEDVSVQLEALDILADLLSRFGGLLITFHPMLLDSLLPQLASPRQAVRKRTIVGLSHLVMSCNTSLYNKLIDHLLEGLSTSTSSSVSRTHIQCIASVAHGIHRQAGHRFGEQLWRVAPQVLKHSTDQDEELREHCLQALEAFVLKCPKEVQPHIPTIIDLCLKMITYDPNYNYEDDEEGGGGGEDEEMEDESFELAEPESDSEEYSDDDDMSWKVRRAAAKCLESVISTRHELLAEMYVTVSPALIARFKEREENVKCDILSAYTALLRATRPPPALHTPLIPAADSPQALLLQRAPASVRARACALALLRELLAAAPGCLADHAARVSLFPCASSMKIETLVFVVWLVRGHAPDCMRPHVSALLPAVLACVHDPFYKVTAEALHVLQTLVKVMRPLEDISRGVSGVSGVGEREVGDWVRGMYDCTLVRLRATDMDQEVKERAIATAGQLICHFGDYLENELPVCLPIFLERLRNEITRLTTVKALTKIASSPLRIDLRPILSDAVPILGSFLRKNQRALKLSTLVLLDTLVQNYSNAISIELLSKVLMEVPALVCEADLHCAQTALTLVRGACERCPAALTPDARHALTPNILALARSPLLQGGALKAMVGVLSALVAADVAGCGLGVRPPRAPGAAGGPRARQPRPCCHSASYQTSNAYHSLAKCVAAVVVSGGSDALDITRGFLKDAAQPRSDTHHMFALLALAEIGRHLDLSSIPNLKEVLLSSFTPSSEEVKSAASYALGSVAVGNLPEFLPFILNEIEAQPKRQYLLLHSLKEIIACESCTPESVEALRPFIPEIWVQLSKHCQCAEEGSRNVVAECLGKLCLLEPQQLLPHLKEFLKSSEPLTRTTAVTAVKFTISDQPQAIDSMLRSCMSELLVPLRDCELGVRRVALVAFNSAAHNKPSLVRDLLPQVLPTIYAETKVKKELIREVEMGPFKHSVDDGLDLRKAAFECMYTLLGTCLDRIDVFEFLRHVEDGLRDHYDIKMLTYLMCARLAHLCPAVVLQRLESLVEPLRATCTMKVKANSVKQEYEKQDELKRSALRAAAALLQIPDADKNPHLMDFVTQIKSFPDLQPIFESILKDSSGGVDSNLMDQS-