Monarch geneset OGS2.0

DPOGS206702
TranscriptDPOGS206702-TA1278 bp
ProteinDPOGS206702-PA425 aa
Genomic positionDPSCF300048 + 1732791-1735480
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0173137e-8657.14% 
BombyxBGIBMGA008533-TA7e-16467.00% 
DrosophilaWDR79-PA8e-8440.63% 
EBI UniRef50UniRef50_Q16ZK87e-10044.54%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q16ZK8_AEDAE
NCBI RefSeqXP_001658989.11e-10044.54%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
NCBI nr blastpgi|1571180622e-9944.54%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
NCBI nr blastxgi|1571180626e-10044.42%hypothetical protein AaeL_AAEL008174 [Aedes aegypti]
Group
Gene OntologyGO:00055151.5e-34protein binding
KEGG pathwayphu:Phum_PHUM3628006e-11 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[57-372] IPR0110461.5e-34WD40 repeat-like-containing domain
[61-373] IPR0159431.6e-30WD40/YVTN repeat-like-containing domain
[240-280] IPR0016805.2e-06WD40 repeat
[244-280] IPR0197816e-06WD40 repeat, subgroup
Orthology groupMCL11852 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206702-TA
ATGGAAGAAGACTTTATATTAGAGAATAATGACGAAAACTGTGAAACTTCTCACATGGATACAGAAGAAACAGATATTTTTACGTATCCTACTCTGTTTAGTAGCAAATCCTTAGTAGAATTGTGCAACTCTTCTTGGTCGAGATCTCAAAACGCGAAACAGAATGTACAGCCCTATTTAAGAGGATGTAAATGGTCTCCAGACGGTACTTGCTGTCTCACAGTGGTTAATAATGACGGGGTTCACGTAACAGAGCTACCAAGAGATCTTTATTCTGGATCCATATCCCCTGATCGAACAATTAATATATTGGATTCTGTTATTCACGTCAAAGAGGCAGGTCTTGTCTATGATTTCTGTTGGTATCCTGGTATGAACAGTAGCATACCTGAGACTTGCTGTTGGTTGACCACTCGTCAGAATGCACCACTGCAGTTTTGGGATGCTTTTGACGGTTCTTTGAGATGTTCATACAGAGGCTTCAATGCAGTGGATGAAATGGAGCCAGCACTCTCGGTCACATTTAATAGTGAGGGAGACAGAATTGTAGCTGGATATAAGAAATATTTGAGAACTTTTGACGTCGAAAGACCAGGAAGAGATTTTGCAGAGCACAAGATCAATTCACCGGCTTCTTGTTTTGCTACACATGATAATCTATTAGCTATGGGCTCATGGAATACAACTATAACTTTATACAATACCAGTGAATTTGGAACATATAAGAGTATTGGGAAAATGCATGGCCACTCAGGGGGCGTCACTCACTTGAAGTTTACTCAAGATGGTCAAAAATTAGTGTCGGGAGCGAGAAAGGATCACAGGCTACTCATTTGGGATATTCGTTATTATCAAAGGCCGCTGAATGTATTAAGTAGAGTCGTTGACACAAACCAAAGGATATATTTTGATATATCACCATGCGGTAAATATTTGGTTACCGGAGGTACGGATGGTGTGATAAAAGTATGGGATGCGGATAACATTGATTGGATTAATAGATTAGATGCTACCGATGACAAAGATAATGCTACATACAGGTTTCCGTTGCATAAAGACTGCTGCAATAGCCTGTCAATACATCCGTTAAGACCAATATTAGCTACCGGCTCCGGTCAATATCATTTCGAGGATCCTGCCCAGGATTTGGAAGAGAATTTCGGAGTACAGGAAGACATTACTGAGACTGATAAAGGGTTAAGAAATGATAAATACTCAAAAATAGCTGAAAATAGCTTAGGGTTTTGGTGGATCGGGGATATTCCTCAAGTAACTTAG

Protein sequence:

>DPOGS206702-PA
MEEDFILENNDENCETSHMDTEETDIFTYPTLFSSKSLVELCNSSWSRSQNAKQNVQPYLRGCKWSPDGTCCLTVVNNDGVHVTELPRDLYSGSISPDRTINILDSVIHVKEAGLVYDFCWYPGMNSSIPETCCWLTTRQNAPLQFWDAFDGSLRCSYRGFNAVDEMEPALSVTFNSEGDRIVAGYKKYLRTFDVERPGRDFAEHKINSPASCFATHDNLLAMGSWNTTITLYNTSEFGTYKSIGKMHGHSGGVTHLKFTQDGQKLVSGARKDHRLLIWDIRYYQRPLNVLSRVVDTNQRIYFDISPCGKYLVTGGTDGVIKVWDADNIDWINRLDATDDKDNATYRFPLHKDCCNSLSIHPLRPILATGSGQYHFEDPAQDLEENFGVQEDITETDKGLRNDKYSKIAENSLGFWWIGDIPQVT-