Monarch geneset OGS2.0

DPOGS208949
TranscriptDPOGS208949-TA4023 bp
ProteinDPOGS208949-PA1340 aa
Genomic positionDPSCF300009 + 341184-363581
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0115500.077.45% 
BombyxBGIBMGA002419-TA0.064.22% 
DrosophilaTusp-PD0.049.55% 
EBI UniRef50UniRef50_H0RNI00.049.55%FI16909p1 n=8 Tax=Diptera RepID=H0RNI0_DROME
NCBI RefSeqXP_002425814.10.048.76%hypothetical protein Phum_PHUM221300 [Pediculus humanus corporis]
NCBI nr blastpgi|3205433610.049.55%tusp, isoform D [Drosophila melanogaster]
NCBI nr blastxgi|3205433610.046.75%tusp, isoform D [Drosophila melanogaster]
Group
Gene OntologyGO:00055152.8e-29protein binding
KEGG pathway 
InterPro domain[1256-1335] IPR0000071.5e-56Tubby, C-terminal
[45-362] IPR0159432.8e-29WD40/YVTN repeat-like-containing domain
[46-358] IPR0110463.3e-26WD40 repeat-like-containing domain
Orthology groupMCL14463 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208949-TA
ATGCATTTACATTTCGAGAGAAACGTTAACGCGAAGTGCGATTGTACTATATTGTCTCTGTCATGGATGGGTAAAGTACCAGACGAGCTGCCCGAGGAAGAAGGATGGAAGCTGAATCGTAACAATTATTATCAAGAAGGTTGGCTAGCGACTGGAAACGTGCGTGGGGTTGTGGGAGTTACATTCACATCTTCACACGCGAGAAGACCACACGAGCTTCCGTTACGAACCAATTACAATCTGCGAGGACATCGATCAGATGTGATTCTGGTGAAATGGAATGAGCCGTATCAGAAGCTGGCTTCATGTGACAGCTCGGGAGTAATCTTCGTGTGGATTAAGTACGAGGGCCGCTGGAGCATTGAGCTGATTAATGATCGGAGCACTCCAGTCACACACTTCTCGTGGTCTCACGATGGTCGGATGGCGCTAATATGCTATCAGGATGGCTTCGTGCTCGTTGGTTCTGTGGCCGGACAGAGGTACTGGTCTTCGATGCTGTCCCTGGATGCTCGTATTACATGTGGATGCTGGACTCCTGATGACAGCCAGGTTTACCTCGGCACTGCATCCGCACAGCTCGTTGTCATGGATGTTCACGGAGCTATGGTGTCCCAGGTGCAATTAGTAGAAGAAGGCGGTATAACTTCTATGGCTTGGTCTTGCGAAAAATTTAATATGGAGGAAGGTGAGGAGGTTGGGGAGAATAAAGGCGGGCATGTGCTGGCTGTAGCTCTTGGCAACGGAGAGATAGTACTATTACGAGGACATGATGACGTCAGTCCGATCAGGATACACACTGGTGTTAGAGGAAACACATTGGCCATGGAGTGGGCGAATTCTAGGGAACTGTTGGCCGTTGCTGGCACTTTAATTGCCGAACCAAATGAACCTGAAGACGAACCTCCCTTTAAGAACATAGTCAAGTTTTACGCGGACACAGGAGCTCTTATTTACACCGTGCCCGTCCCTTGTACCCAAGCGCGTGTGACAGCTCTGACGTGGGGTCATGCAGCGCGTCGCTTATTCGTGGGTGTGGGGGGCTCCGTTTGCACTGCTCGCGTGTGGCGAGTCGTGGCACCCCTGCAACTGTTGGCACGTGTTCGTGCCGCTGCCGCTCTACGTCACCCGCGCCTCGCTGCCAAACTGCCTCTACCACCTAGACTGCAGCCCGCTTTAGCGAGTTTATTCGCTCATACCATACGGTGCAACGTCCCTGAAGCGAACGAATTACGCCGGTTTGTATCTCGTCCACCGGCCTCCGGCGGCCGTCTTCACTGCACTATGCTTCGACATGACGATGAGGAGGCGGGTGCTTATACACTCTACCTCGAACATCTCGGGGGACTAGTGCCCTTACTCAAAGGAAGGAGGACCAGCAAGATCAGACCAGAGTTCGTGATATTCGATCCTCAGGGTGAAGCAAACCTTCCAGAAGGGGTCCCTCCGCCGCGTGAGTCAAGCAGCAGTAGCAGCACTAGCGCGTCTAGCAGCAGCGGTGGCGTCCGTGGCTCCCCCCGCCTGTCCCGGGCTCACGGCTCGCCCCGTTCCCGAGCCCCCCGACCAAGACCCGCGACACAACACGACTCTTGCTCCTCGGACACTGAACGAGAGGATGGTTGCTCTGGCTCTCCACGCTTACAAAGACGCAGACGAGCTCGAGAACGACGGAAAGCTCGCAATGCCAGTGAAAAAGATGACACCCCTGATGAATTGGCCTATATAGACTCGTTACCAGAAGATGTACGTCTAGTGGAAGTAACATCGAACATATGGGGGACCAAATTTAAAATGCACGGTTTGGCCAAAAACGTGCCGGCGAACCTCGGACAAGTGACCTATAAAACTTCATTACTACACCTCCAACCCAGACAGATGACATTGACAATAACAGAACTCAGAGATGACTACCCCGTCGGGCCAGATCCTAATTTCAATCCGAATATATTTTCCGAAGACGAAGAAGAAGTATTCCAGCCTAGCCAAACAAACTCTCCGTCACACACTAATAACAATGTTAGAAAAAAAATCAATCCATTAATAATGACTGATAGGATGACTAGCACTAATAATATAATAAATCAATCGGAAATATATTCACCCAACAACAACTCAAACCCCGCTATGGCTAGGGCGGAATCTTACGACGAATTTCCCTTTATAGATACCAACGAATCTGTTAACAATGTGACAGATAATGTTTTTAATTCGACTCCTGTTCGACATAGTAACCAGACGCCAGAAAGACGTGTCAGCAACACAGCCGGGGTGACCAACCGTCATGCTATATCACCTTTGAGGTGTGAGGGTTCCGTACCCACATTGCAGTCCCCTAAGAATGCTGTTGCACCGACCGATATAATTTTCGAGAGGCCCTCGCCTCAGACTGTCACTTGCGGCGGTAGGGGCGACTTTTGTGGAGGACGCACGGAGTATAGCGTCAGAGATAATGTTGCAATCAAAGGAAATCTACCCAACATGGACCATCAAGTGACCATAAAGAAGTGCGATAACCATGTGGCCGACAATTGTCTGTCAAAGTTAAGAAAAAATATATGCAATAGGGGAGAATCGTCATACGAAATTAATACTAACAGAATTTTAAAATCTCTCTGTAATAAGAATTTTGAACACGAAGTTCATCCTGACGCCATGTCTAAAAGATCAGAAACACTTAAAAATTTGCAAAAAGGTGAAGATTTAAAATTTATTGACGAAGAGACTCCAGTCGATACGAATATTAATAATTTAACAGATAAAGTTCGAGAAGGTAGAGTTCAAAGAACGACTACTGTAGTACCCATTAGTCCAGTGTGTGCAAACATTCCCGTATATGATACGATGACGCGCAGTTGCAGTGTGGGCTATCTTGACTTGGTCGACCCACAAGTACTACATGCTCAAGTCAGCCTGACCGCCTTGAGGGGAGAACCGCCACGTCGCCTGATACTCGTAAATACGAAACGACAACGACGCCCGAAAAGATACCTTAAAAACGATATGAAACAAATGGACAACAAAACACCAAGCCTGAAGAAATGTGGGAAATCGCGGAGCTTGGATTCTGGAGAACTGACCATCACAGCAGAGAAAAGTAAACGGCAATCGAAAACGGAGGCGAAAATCAAGCCCGACGTCTCCGCCAACTCCAGTAGTCGCTACAACAGTACAGGGGAAGAGGGCAGCGGCACCAGTACGGAGGAAGGGCGAACTAGAGGAGGATGCCAGCTTGGGACTAGACCCGCCGGCTCGGGCACACCGCCCGCAGCTCGGGCAGCGCCCCCGGCACCCCTCCTCACCGACAGTGATTCCGACTACAGCAAGTATTATAGTTCTTTGGAGCAGTTGGCTCTGCGGCTGCTGGCTTCGCGGTCATCGAAGCGTGCTGGCGAGGGGATCGCCGGTGGTGTCGCGGGGGTCGCGGGGTCATTATCTCGCGAAAGCAGCTCCGCACCAGCCTCTCCTCGTCCGGCGCGGGCCACCTCCCCCACCCCCGCGACACCAGCTCCGAGGACCAGGAGACAGAGATACTCCTCCGCGTCACCGATAAGACAATTGCTCAACTCGCCTCTACTAAATAGAAGAAGAAACAAGAAGCTGTCTGAGAGTTCAGACGACGAGTACTCGAACGGTTACAACGAAGTTAACAGCAAGAACTACAGAGACCTTGAAAGTTTCCAGAAGGCACAACTTAGAAATAAGCTAAAACGTGCTGGCGGTAACATGGGCGCGGGTGGAGGGGCGAGTGCGGGGGGTGCGGGGGAGGTCCGAGAGCGTCGTCAGCTGATGATGCACAACAAGGCGCCCATGTGGAACGAGAACAGCCAGGTGTACCAGCTAGATTTCGGCGGACGGGTCACGCAAGAGTCCGCCAAAAACTTCCAAATTGAGTACCACGGGAAGCAAGTGATGCAATTCGGTCGTATCGATGGCAACGCGTACACACTGGACTTCCAGTACCCGTTCTCGTCGCTGCAGGCCTTCGCGGTCGCGTTAGCGAACGTGACGCAACGGCTCAAATGA

Protein sequence:

>DPOGS208949-PA
MHLHFERNVNAKCDCTILSLSWMGKVPDELPEEEGWKLNRNNYYQEGWLATGNVRGVVGVTFTSSHARRPHELPLRTNYNLRGHRSDVILVKWNEPYQKLASCDSSGVIFVWIKYEGRWSIELINDRSTPVTHFSWSHDGRMALICYQDGFVLVGSVAGQRYWSSMLSLDARITCGCWTPDDSQVYLGTASAQLVVMDVHGAMVSQVQLVEEGGITSMAWSCEKFNMEEGEEVGENKGGHVLAVALGNGEIVLLRGHDDVSPIRIHTGVRGNTLAMEWANSRELLAVAGTLIAEPNEPEDEPPFKNIVKFYADTGALIYTVPVPCTQARVTALTWGHAARRLFVGVGGSVCTARVWRVVAPLQLLARVRAAAALRHPRLAAKLPLPPRLQPALASLFAHTIRCNVPEANELRRFVSRPPASGGRLHCTMLRHDDEEAGAYTLYLEHLGGLVPLLKGRRTSKIRPEFVIFDPQGEANLPEGVPPPRESSSSSSTSASSSSGGVRGSPRLSRAHGSPRSRAPRPRPATQHDSCSSDTEREDGCSGSPRLQRRRRARERRKARNASEKDDTPDELAYIDSLPEDVRLVEVTSNIWGTKFKMHGLAKNVPANLGQVTYKTSLLHLQPRQMTLTITELRDDYPVGPDPNFNPNIFSEDEEEVFQPSQTNSPSHTNNNVRKKINPLIMTDRMTSTNNIINQSEIYSPNNNSNPAMARAESYDEFPFIDTNESVNNVTDNVFNSTPVRHSNQTPERRVSNTAGVTNRHAISPLRCEGSVPTLQSPKNAVAPTDIIFERPSPQTVTCGGRGDFCGGRTEYSVRDNVAIKGNLPNMDHQVTIKKCDNHVADNCLSKLRKNICNRGESSYEINTNRILKSLCNKNFEHEVHPDAMSKRSETLKNLQKGEDLKFIDEETPVDTNINNLTDKVREGRVQRTTTVVPISPVCANIPVYDTMTRSCSVGYLDLVDPQVLHAQVSLTALRGEPPRRLILVNTKRQRRPKRYLKNDMKQMDNKTPSLKKCGKSRSLDSGELTITAEKSKRQSKTEAKIKPDVSANSSSRYNSTGEEGSGTSTEEGRTRGGCQLGTRPAGSGTPPAARAAPPAPLLTDSDSDYSKYYSSLEQLALRLLASRSSKRAGEGIAGGVAGVAGSLSRESSSAPASPRPARATSPTPATPAPRTRRQRYSSASPIRQLLNSPLLNRRRNKKLSESSDDEYSNGYNEVNSKNYRDLESFQKAQLRNKLKRAGGNMGAGGGASAGGAGEVRERRQLMMHNKAPMWNENSQVYQLDFGGRVTQESAKNFQIEYHGKQVMQFGRIDGNAYTLDFQYPFSSLQAFAVALANVTQRLK-