Monarch geneset OGS2.0

DPOGS202665
TranscriptDPOGS202665-TA4080 bp
ProteinDPOGS202665-PA1359 aa
Genomic positionDPSCF300039 + 171938-181535
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0022221e-14670.15% 
BombyxBGIBMGA001293-TA0.069.57% 
DrosophilaOseg6-PA0.040.32% 
EBI UniRef50UniRef50_UPI00022CA03D0.044.32%UPI00022CA03D related cluster n=2 Tax=unknown RepID=UPI00022CA03D
NCBI RefSeqXP_975372.10.045.51%PREDICTED: similar to WD repeat domain 19 [Tribolium castaneum]
NCBI nr blastpgi|910807190.045.51%PREDICTED: similar to WD repeat domain 19 [Tribolium castaneum]
NCBI nr blastxgi|910807190.045.51%PREDICTED: similar to WD repeat domain 19 [Tribolium castaneum]
Group
Gene OntologyGO:00055154.1e-29protein binding
GO:00054885.2e-05binding
KEGG pathway 
InterPro domain[36-336] IPR0110464.1e-29WD40 repeat-like-containing domain
[23-167] IPR0159431.3e-16WD40/YVTN repeat-like-containing domain
[244-632] IPR0110476.2e-12Quinonprotein alcohol dehydrogenase-like
Orthology groupMCL12613 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202665-TA
ATGTCGACTCCCAGGCTCTTATACACTATTGAACAACCTCACGGGCTCGGCGAACTGTACTTTAAGTGGCAAAAAGGCGAATCTCATTCACTTCTTGCTACTACTGGTACTGACGCCACAGTAGCTATACACGATAGAGGCGGACAACTGCTTGAAAGACTTAAGCTGCCAGGTTTATGTTCTGGTATGGAATGGGATAACGATGGTGATTACTTAGCAGTCATTACACCAAACAGTAATACTGTGCTCCTTTGGGAGTGTCATTCTAATAAACGGATCAATATAGAGACAGGGCTCCGAGAGGCTCCATCATGTTTAGCTTGGTATTATGGAGAGCCATTGTTAGCTATTGGGACTCAGAAAGGAAATCTGGCTTTGTATAATCATCACACTACAAAGAGAATACCCATATTGGGTAAACATACAAAGAAGATTACATGTGCAGCATGGAATCGTGACAGCATATTGGTACTTGCCTCAGATGATAAGAACTTATCAATAAACAACTCCGACGGAGACACACTCAGAATGATATCTCTAAGAGACATACCAAATGATTTACAATTTTCAGAAATGAAGACCGACGAAAGAGTTGCAGGAGAGAATACTATAAGTCTCATTGTTGGAAAACGGACTTTGTACTTATACAACCTCTTGAATCCGGAGAATCCGATAGAGCTGGCATTCCAACAGAGATATGGAGTGATAGTATCATACAAATGGTATGGAGATGGTTACATCCTCATAGGATTCAGCGGTGGATATATCATAGCTATATCAACGCACATCAAGGAAGTGGGCGAGGAATTATTCCAAGTGAAGAATCACAAGGAGAATCTGTCCGACATGGCAGTGTTCAACGGACAAGCTGCTTCATGTGGGGATGGCCAACTTAAACTCATCTCGGTGTGGAACAGTGGTGAGGTCTGCGGTTCCATTGTTCCTGTGGGCGGTAGTGAGAGGGTCGAGTGGAGTGCAGACGGGCGACTGTTGGCCACGGCTGGTAGAGGATACCTCAGCGTGTATGTGTCTGCGCTGCCTCCGCTGCACGCCTCGCACGGCACCAGGGTTGTCACCCTCACTAACCTCACGGAAGTTACAGTCTACCAGTGTATCGGCGTTGGAGATGACAGCGACGCGGTTAATAAATCGGAGCCGGCAGCCCTGGCCACGTACAGCCTGCCGACGGAGCCGGAACTGCTTTCTCTGGGCGGATCTCACGTGTGCTGCTCGTCGGGTGTGGTGGGCTGGTTCGTTCCGCTGGGGGGCGGGTCCCCCCACCGCCGTCACTACCCGGGGGCTCCCAGCGCCCTCACAGCGACCGGGGGCTACGCTGCCGCACTCTTCGACGGCAAGGCTATGTTGCACTCGATCGAAGCGTCAGCCTCGTGCGAGCCGGAGCGCGAGGCTATGCTGTTTCCTGAGCAGCACATGAGCGGACTGAGGATCGTGGACATACATCTCACCGGCGACTTCTTTATATTCGTCACTGATCAAGGTCACATCGAGTATTTCAGTATAGAGAGCTGGTGTGCGTGTTCCCGGTTCCGCCACGCGGCGGGCGCGCGCTCCCTGCACTGCGACATCAGCGGCGCGCGGGCGTTGTTGGTGGACGGGCGAGGGAAGGCGCACGTGTACAGCGCCCCCTCCGGGGAGCTGTGCGCGCTAGTTGATACTCACAGCGTGAAATCGGCCGTATGGGACATCTGTCTGTCTGATCGTAACGTGTTCGCTGTGTTCACGGGCGAGGCCGTCCACACGTACTATTATTCCCCGAGCTCTATCGACGGGCCTCACGTCGATTTCGTGGCCACAACTGCCATGGAACAGCTGCCCCTCATCCTGTTCTCTGGAGACGTCTACTGTTATTCCAGCGGAGGCGGTGTTCTAAGAATACCTCTGGATTCTCACAACACAAGTGGTCTGGCCGACGCTGATACAGAGAGACGCCTCCTGAATCAGAGACAGCACATCCATAAATTGCTGCTCCTGAGGCGGTTCACTGAGGCCTGGCTGTTCTGTGACGCCGTGAATGAAGAGGACTTGTGGAGGAAACTTGGAGAGGCGGCCGTGGCTGAACTCGCTGTCGAATTTGCGATACGTGTCTACACTAGACTCAGCGACGTCGCCATGGTTTGGGCTCTAGAGGACGCGCTTCATATTGAGGAGATACCGGTGCTATGTGGGATGTTATGCGCGTGTCTGGATAGAGGAGAGGCGGCGGCTCGGTGGTTGGAAGGCGGACCACCGGCTAGGGCGTTAGAAATATATGCAGCGAGGGGAGAGTGGGGTCGAGCTGCTGACCTGGCCGCCACCACCTGCCCCTCACGAGTACCGTACACCGCTCTACATCAGGCACAGCACTTGGAACTCACAGCCGATTACCACGAAGCTCTCGCGAATTATGAGAAGAGTATTATAACAGAAAACTTAGACGACCTCAAAGTGAAAGAACACAATGAGAAGTGTGAAGCCGGAATAGCGCGCACGGCCATACGCTGTGGTGACGTCATGAGAGGGGTCACCACTGCTATGAAACTCGCGCACTATCCGAACCTACTTAAGGACTGCGCTCAGCTACTGGTAGAAGAGAAACAGTACAGCCACGCCGCGGCGTTGTACGACCACGCGGGGAACACGGAGAAGGCTGCCTCGCTGTACATCAAACTGAAATCCTGGCTGAAAGTAGAGGCTCTGTTACCAAAGATCCATTCGCCGAGCATTCACCTGCAATATGCGAAAGCCAAAGAGGCCGAAGGCAGATACGGAGACGCCCTGAAGTCGTATTTGAAAGCTCAAGACTTCGAATCGGCCATCCGCTTGAACTTGGACAAACTGGACGATATAGATGAAGCCGTTAATCTGGTCCAAGAGACGAAATCGGTCCAAGGGGCAAAAATGGTCGCCAACTATTTCCAAAACAGCGACGACCCGACGTCGGCGATAAAATTCCTAGTTATGTCCCTCTGTTACGACGAGGCGTTCCAACTGGCGCGGAAGAACGGCAAGTTGCATCTGTACGGTGAGATCCTGATACAAACGTCACAGGCGAGGCCCGAGGACTTCAGAAGCTTGGCGCTGCACTTCGAGGGGGAAAAGAACAGTCTGTTGGCCGGCAAGTACTACTTCCACGCCGGTGATCATAACAAGGCGATGACACACCTGCTGAGGGCGGGATCCTCGGAGAGCGAAGAAAACGAAGCCATCACCGTCGCCATAGACTCCGCGGCCGCCAGTGACGACGACAGGTTGACCAGGAGGTTAATAGAGTTTCTTCTGGGAGACTTGGACGGCAACCCTCGTGATCCTCGCCACCTCTTCCGTCTCTATATGGCCAAAAAGCAGTTCACGGAAGCTGCTAAAACAGCGGTAGTTGTATGTGCCAGCGAGTGTGCTGTGGGTAGGTACCGTGAGGCCCGTGACGTGTCCCGAGGGCTGGTCTGCGCGCTGAGGACGCGCCGAGCGACACCTCCCAGGGACCTTATGCACCACCTGGCGCTACTACACTCATATATACTGGTGCGGACTCACGTGCGCCGCTCCCGTCACGACCTGGCCGCTGTCCTCCTGCTCCGGACAGCAGCTGAAGTATCTTTCTTCCCCACAGAACAGCATCAAGTGTCGATACTGACGTCCACCGTGATCGAGTGTCTGCGGGCCGGGATGAAGCACCAGGCGTACCAGTGGGCGAGGGTGTTGATGGAGCCGCAGTATAGGAGCAAGATCGATCCGAAGTATATAAAGAAGGTGGAGTCAGTGGTGCGCCACCCGCCGCGAGGCTCGGCCCCGCCGGCGGACCGCTCCCCCTGTCCGAGGTGCGAGGCGCCGCTCGTCAGGTCCGCCCTCAACTGCGCGAGCTGCGAAGACGACGTGCCCTTCTGTATAGCGACCGGTCTTCATATTGAGAGGCAGGACCTGACGGCGTGTCCCGAGTGCGACTTCCCGGCCATATATTCAGAATTCAAAGAGATTCTCAGCGAGGACGGCAAGTGTCCGATGTGTAACGAGTCAGTGGATTACCGTCGCCTGGTCAGAATAGACGACGTGTCGCTCTACCTGGACTCTAAGACCGGGCAATAA

Protein sequence:

>DPOGS202665-PA
MSTPRLLYTIEQPHGLGELYFKWQKGESHSLLATTGTDATVAIHDRGGQLLERLKLPGLCSGMEWDNDGDYLAVITPNSNTVLLWECHSNKRINIETGLREAPSCLAWYYGEPLLAIGTQKGNLALYNHHTTKRIPILGKHTKKITCAAWNRDSILVLASDDKNLSINNSDGDTLRMISLRDIPNDLQFSEMKTDERVAGENTISLIVGKRTLYLYNLLNPENPIELAFQQRYGVIVSYKWYGDGYILIGFSGGYIIAISTHIKEVGEELFQVKNHKENLSDMAVFNGQAASCGDGQLKLISVWNSGEVCGSIVPVGGSERVEWSADGRLLATAGRGYLSVYVSALPPLHASHGTRVVTLTNLTEVTVYQCIGVGDDSDAVNKSEPAALATYSLPTEPELLSLGGSHVCCSSGVVGWFVPLGGGSPHRRHYPGAPSALTATGGYAAALFDGKAMLHSIEASASCEPEREAMLFPEQHMSGLRIVDIHLTGDFFIFVTDQGHIEYFSIESWCACSRFRHAAGARSLHCDISGARALLVDGRGKAHVYSAPSGELCALVDTHSVKSAVWDICLSDRNVFAVFTGEAVHTYYYSPSSIDGPHVDFVATTAMEQLPLILFSGDVYCYSSGGGVLRIPLDSHNTSGLADADTERRLLNQRQHIHKLLLLRRFTEAWLFCDAVNEEDLWRKLGEAAVAELAVEFAIRVYTRLSDVAMVWALEDALHIEEIPVLCGMLCACLDRGEAAARWLEGGPPARALEIYAARGEWGRAADLAATTCPSRVPYTALHQAQHLELTADYHEALANYEKSIITENLDDLKVKEHNEKCEAGIARTAIRCGDVMRGVTTAMKLAHYPNLLKDCAQLLVEEKQYSHAAALYDHAGNTEKAASLYIKLKSWLKVEALLPKIHSPSIHLQYAKAKEAEGRYGDALKSYLKAQDFESAIRLNLDKLDDIDEAVNLVQETKSVQGAKMVANYFQNSDDPTSAIKFLVMSLCYDEAFQLARKNGKLHLYGEILIQTSQARPEDFRSLALHFEGEKNSLLAGKYYFHAGDHNKAMTHLLRAGSSESEENEAITVAIDSAAASDDDRLTRRLIEFLLGDLDGNPRDPRHLFRLYMAKKQFTEAAKTAVVVCASECAVGRYREARDVSRGLVCALRTRRATPPRDLMHHLALLHSYILVRTHVRRSRHDLAAVLLLRTAAEVSFFPTEQHQVSILTSTVIECLRAGMKHQAYQWARVLMEPQYRSKIDPKYIKKVESVVRHPPRGSAPPADRSPCPRCEAPLVRSALNCASCEDDVPFCIATGLHIERQDLTACPECDFPAIYSEFKEILSEDGKCPMCNESVDYRRLVRIDDVSLYLDSKTGQ-