Monarch geneset OGS2.0

DPOGS206701
TranscriptDPOGS206701-TA1278 bp
ProteinDPOGS206701-PA425 aa
Genomic positionDPSCF300048 + 1722657-1725424
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0173136e-8657.14% 
BombyxBGIBMGA008533-TA1e-16366.75% 
DrosophilaWDR79-PA2e-8340.39% 
EBI UniRef50UniRef50_Q16ZK81e-9946.68%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q16ZK8_AEDAE
NCBI RefSeqXP_001658989.13e-10046.68%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
NCBI nr blastpgi|1571180625e-9946.68%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
NCBI nr blastxgi|1571180625e-9946.55%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
Group
Gene OntologyGO:00055154.1e-34protein binding
KEGG pathwayphu:Phum_PHUM3628008e-11 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[57-372] IPR0110464.1e-34WD40 repeat-like-containing domain
[61-373] IPR0159433.6e-30WD40/YVTN repeat-like-containing domain
[240-280] IPR0016805.2e-06WD40 repeat
[244-280] IPR0197816e-06WD40 repeat, subgroup
Orthology groupMCL11852 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206701-TA
ATGGAAGAAAACTTTATATTAGAGAATAATGACGAAAACTGTGAAACTTCTCACATGGATACAGAAGAAACAGATATTTTTACGTATCCTACTCTGTTTAGTAGCAAATCCTTAGTAGAATTGTGCAACTCTTCTTGGTCGAGATCTCAAAACGCGAAACAGAATGTACAGCCCTATTTAAGAGGATGTAAATGGTCTCCAGACGGTACTTGCTGTCTCACAGTGGTTAATAATGACGGGGTTCACGTAACAGAGCTACCAAGAGATCTTTATTCTGGATCCATATCCCCTGATCGAACAATTAATATATTGGATTCTGTTATTCACGTCAAAGAGGCAGGTCTTGTCTATGATTTCTGTTGGTATCCTGGTATGAACAGTAGCATACCTGAGACTTGCTGTTGGTTGACCACTCGTCAGAATGCACCACTGCAATTTTGGGATGCTTTTGACGGTTCTTTGAGATGTTCATACAGAGGCTTCAATGCAGTGGATGAAATGGAGCCAGCACTCACGGTCACATTTAATAGTGAGGGAGATAGAATTGTAGCTGGATATAAGAAATATTTGAGAACTTTTGACGTCGAAAGACCAGGAAGAGATTTTGCTGAGCATAAGATCAATTCACCGGCTTCTTGTTTTGCTACACATGATAATCTATTAGCTATGGGCTCATGGAATACAACTATAACTTTATACAATACCAGTGAATTTGGAACATATAAGAGTATTGGGAAAATGCATGGCCACTCAGGGGGCGTCACTCACTTGAAGTTTACTCAAGATGGTCAAAAATTAGTGTCGGGAGCGAGAAAGGATCACAGGCTACTCATTTGGGATATTCGTTATTATCAAAGGCCGCTGAATGTATTAAGTAGAGTCGTTGACACAAACCAAAGGATATATTTTGATATATCACCATGCGGTAAATATTTGGTTACCGGAGGTACGGATGGTGTGATAAAAGTATGGGATGCGGATAACATTGATTGGATTAATAGATTAGATGCTACCGATGACAAAGATAATGCTACATACAGGTTTCCATTGCATAAAGACTGCTGCAATAGCCTGTCAATACATCCGTTAAGACCAATATTAGCTACCGGCTCCGGTCAATATCATTTCGAGGATCCTGCCCAGGATTTGGAAGAGAATTTCGGAGTACAGGAAGACATTACTGAGACTGATAAAGGGTTAAGAAATGATAAATACTCAAAAATAGCTGAAAATAGCTTAGGGTTTTGGTGGATCGGGGATATTCCTCAAGTAACTTAG

Protein sequence:

>DPOGS206701-PA
MEENFILENNDENCETSHMDTEETDIFTYPTLFSSKSLVELCNSSWSRSQNAKQNVQPYLRGCKWSPDGTCCLTVVNNDGVHVTELPRDLYSGSISPDRTINILDSVIHVKEAGLVYDFCWYPGMNSSIPETCCWLTTRQNAPLQFWDAFDGSLRCSYRGFNAVDEMEPALTVTFNSEGDRIVAGYKKYLRTFDVERPGRDFAEHKINSPASCFATHDNLLAMGSWNTTITLYNTSEFGTYKSIGKMHGHSGGVTHLKFTQDGQKLVSGARKDHRLLIWDIRYYQRPLNVLSRVVDTNQRIYFDISPCGKYLVTGGTDGVIKVWDADNIDWINRLDATDDKDNATYRFPLHKDCCNSLSIHPLRPILATGSGQYHFEDPAQDLEENFGVQEDITETDKGLRNDKYSKIAENSLGFWWIGDIPQVT-