Monarch geneset OGS2.0

DPOGS205519
TranscriptDPOGS205519-TA1812 bp
ProteinDPOGS205519-PA603 aa
Genomic positionDPSCF300056 - 15775-21216
RNAseq coverage646x (Rank: top 20%)
Annotation
HeliconiusHMEL0037980.064.74% 
BombyxBGIBMGA000154-TA2e-17050.57% 
DrosophilaCG30349-PA4e-3924.75% 
EBI UniRef50UniRef50_D6WFF91e-5932.63%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WFF9_TRICA
NCBI RefSeqXP_001604509.13e-6329.30%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565456946e-6229.30%PREDICTED: WD repeat-containing protein 43-like [Nasonia vitripennis]
NCBI nr blastxgi|1565456943e-6229.05%PREDICTED: WD repeat-containing protein 43-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.5e-16protein binding
KEGG pathway 
InterPro domain[5-236] IPR0110461.5e-16WD40 repeat-like-containing domain
[456-557] IPR0071481.9e-16Small-subunit processome, Utp12
[5-236] IPR0159433.6e-16WD40/YVTN repeat-like-containing domain
Orthology groupMCL11557 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205519-TA
ATGGCAGAGGCCGCCTTTTCGGAGGACGGGAAATATTATTCATTAATTACACAGGACGGTAGATTGAGAATATGGGACACAGAAACTAATGTCCTCAAACAGGAGTATACTCCGGACCTCCATTTAACATCACCCCCCTCGTGTTTACAGTGGATATCTGTCACCTCAAATACCAGTCCCACAAAAGGTGTCAGAAGAAAATCATTCAGTGGAGTCGAAACACAATGTATAGCATTAGGAACAGCTAGCGGAAAAATACTTATATACTCAGTGAGCGAGGCGCAAATAGAGACAGTTTTAACAGACAAAAACAACGCTATTCACAGCAAAGTACAAAGTTTGGATTGGCATAGAAAATACGGTCTATTCAGTTGCACCAGTGACAGTCATGTGTGGGAATGGGACTTGCAAAGTTCTTGTGTTAGACATAAGTACAATATCCATGTATATAGTAAAAATAAACAGGGAAACAAGATCAGTGCTATCAGGATCGTACCACACAATCAGAACACTCCGGCAAAATTCCTCATATCTTCGTCGTGGCAGATACGTCTGTGGAGATTAATTGATTCCGAAGCCACTCTCATCAAATGTCTGGGGCACAATGCTGCTCCGCGAGCATTACTAACCGTTGCCACGGTTAATAATAGCAGTTGGTTAATCGAAGGCTCTCAAACGGAACGCCTCCTATCATTCTGGGATGTCACGATCACCGAGGAAAACTTGCCACAAGAAAACTCAGAGGAATCCACACCGTGTAAGAAACAAAGGAAGAAATCCTTGACCTCAGCTACCGTACCCGTCCCAACATATAACTTTGTTTTGGAAGACGCTCCCAGGATTATAGATGTGGAACTTAAGGTGGAGAACGGAGCCACGAGGTTGTCCCTGGCAGCGGCCACGAGGAGCGGGGTTGTACATTACTATGGCCATTTGCTCAATGGAGCGTCACACAAACCAGTGAAGCCGTCAGTGACTATCCAAGTCACGACCGAAGACGCTCAGCCGCTGCCCTTACAATGTGTCCAGCTACCGAAAACTGGTGACTTGCTGATTGGATACTGCAGTGGACCCGGTGTAGTGTTTGAGAAAGTCATACCAGATTTGAAGACAAAAACTCAGGTCCTTATACGAGGTGACAAGTCAAAGGAAGATAAGAATGAAGTTGTTCCGAAAGAAAATGATCAAATAACTTATGTTGAGCCGCTGGGTGGAGTTAGTAGGAAACGTGGCAATGTAGGAGGGAAGGTGGAGGTGACGATGGAATCCCGTTTGTCAAACCTGTCCCTGGATGTGAAGAGTCGGAGCAAAACAGCTGTTAACCAGAACTTAACCAAACTTTTAATACAAGGATTACACTCCAGAGATAAGAATTTAATACTAACTGTACTTCAACGTAACGACCCAGCGGTGGCCCACAACACGATATCATCCCTCCCGGTCAATTACATACCAGCTCTGTTGGAACAGTTAACAGATATGGCTACGAGGAAAACTAGCCAGTGTTCATCTGTTTGCACCTGGCTATGTGCTGTTCTTCAGTGTCACTCGGCATCTCTTCTGGCGACTTCGCCCCATCAGTCCGAACACCTCACAAAACTCCTGGCTATATTCACACAGAGACGCTCCAACCTGTGTCAGCTGCTGAGTCTGAAGGGCCGGCTGGACCTCACCTCGTGTAGGCGACTCGCTCCCGAAGGGGATTCGCATACGCTACTGCAATATGAAGACTCATCATCAGACGAAGACGTTCCTCCCGAGGGCTACGAGTCCGACAGCCGGCAGTCCTGGGACGACGGCGACACAGACTGA

Protein sequence:

>DPOGS205519-PA
MAEAAFSEDGKYYSLITQDGRLRIWDTETNVLKQEYTPDLHLTSPPSCLQWISVTSNTSPTKGVRRKSFSGVETQCIALGTASGKILIYSVSEAQIETVLTDKNNAIHSKVQSLDWHRKYGLFSCTSDSHVWEWDLQSSCVRHKYNIHVYSKNKQGNKISAIRIVPHNQNTPAKFLISSSWQIRLWRLIDSEATLIKCLGHNAAPRALLTVATVNNSSWLIEGSQTERLLSFWDVTITEENLPQENSEESTPCKKQRKKSLTSATVPVPTYNFVLEDAPRIIDVELKVENGATRLSLAAATRSGVVHYYGHLLNGASHKPVKPSVTIQVTTEDAQPLPLQCVQLPKTGDLLIGYCSGPGVVFEKVIPDLKTKTQVLIRGDKSKEDKNEVVPKENDQITYVEPLGGVSRKRGNVGGKVEVTMESRLSNLSLDVKSRSKTAVNQNLTKLLIQGLHSRDKNLILTVLQRNDPAVAHNTISSLPVNYIPALLEQLTDMATRKTSQCSSVCTWLCAVLQCHSASLLATSPHQSEHLTKLLAIFTQRRSNLCQLLSLKGRLDLTSCRRLAPEGDSHTLLQYEDSSSDEDVPPEGYESDSRQSWDDGDTD-