Monarch geneset OGS2.0

DPOGS213513
TranscriptDPOGS213513-TA2091 bp
ProteinDPOGS213513-PA696 aa
Genomic positionDPSCF300033 - 965024-970303
RNAseq coverage317x (Rank: top 36%)
Annotation
HeliconiusHMEL0077781e-14955.42% 
BombyxBGIBMGA011794-TA1e-12956.31% 
DrosophilaCG5018-PA1e-8731.70% 
EBI UniRef50UniRef50_D6WEU43e-12836.53%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WEU4_TRICA
NCBI RefSeqXP_974522.15e-12936.53%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910784061e-12736.53%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910784063e-12636.39%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00055152.6e-35protein binding
KEGG pathwayago:AGOS_AEL246C1e-08 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[7-312] IPR0110462.6e-35WD40 repeat-like-containing domain
[10-292] IPR0159432.9e-32WD40/YVTN repeat-like-containing domain
Orthology groupMCL14062 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213513-TA
ATGCCTTGTAAGCTTCATCGAGTGAGATATTATAATCCAAAACCGGTACAAATAAATTGTGTCTCATTTAATAAAAGCAGTAAAAGAATCGCATTGGCTAGACAGGACGCGTCTATAGAAATATGGGATTTAAATTTTGCACCTCTCCTTGTGCAATGCATTTCTGGCACGGAAGATACTTCGGTGGAAGCTCTGGGTTGGGTGCACGACAGGCTTTTATCAACAGGCCTTGGCGGTGCTCTTATAGAATGGGATCTTGAAAGTCTGACGATAAAATTTTCAGTAATGCTCACGGGTTACGCTGCGTGGTGTCTTGATGTTAACTCGGCAAACACTGTGGTAGCTGTTGGAACAGAGCAAGGCTACATCAACTTATACTCTGTAGAAAATAATGAAATTGTTTATAAAAAACTCTTTGATAAACAGGAAGGCAGGATTATGTGTTGTAAATTTGATAAAACCGGCAACACTCTGGTTACAGGCTCGGTGGACACCATAAGAGTTTGGAATGTAGAGATGGGATATGCAACTTGCAGGATTTCTGTTAACAGAAGGGGGAAAGAGACTATAGTGTGGTCATTAGCCGTACTGTCTGATAACACAGTTGTGTCCGGTGATAGTCATGGAAGACTTACATTCTGGGATGGCAATCTCGGAGATCAGATTGAATCCTACACAACCCACAAGGCTGACATTCTATCCATTGTTGTGTCTGATGATGAGAGGAGTCTATACTGCAGTGGAGTAGATCCGGTCATAACCAACTTTGTCAAAGTCAATAACAGCGCGGGCAAACAGACATGTGCTCGCTGGGTGAAAAATGTTCAGAGGAATATTCACGAACACGATGTAAGAGCCCTTGTTCTGAATGGAGAAAAATTATTATCAGTTGGAGCGGATGGATATTTGACGCTGTCAAGTTACCCGCCAAAGTGGGTGATGCGAATCCCACCCATGATACCAGCACCGAGATCGTGCGTCAGCGCTCGCAACAAGTTACTACTACTTAGATACAGCAATTACCTTGAAATATGGAAATTGGGCTCGTATGCCATCAACAAGAATGGGAATGTCACAGTGAATAGTGTCAACGTGGAGCCGAGTGTCAACTCAGGCAGCAACCAGTTGGAGCAAGATTCACAGTTCATCAGTCAAGTAGGAAAACACACTCAGAAACAGAGCTTGAAGCTGATAGAGCAGCCGACGCGCCTGGTCTGTATACAGACTAAGGGAAAAAAACAGATAAGATGGTGTGAGATGTCACCCAGTGGGGAATTGGTTGTGTATTCAACGGACAGCGATTTGAGAATGCTGAAGTTGGATTGTGACGATGATCAAAGTAACGTATCCCTCACCAAAGTATTCATAAACGGCATATCACACTGTGATCGTGTTGCCTTCACCGCTGACTCGAGGACGTTGGTGGCATACGGAGATGGGACAGTGTTCATACTGCAGGTTGACCCGGAATCTGGTGCCACGGTGGTGCAGACCTTACCATGCGACCAATATCTTAAGATGAAGTCTATTCTGCACCTTGTTGTATCGAAGGATGTTTCCAGACATATATATGTGGTCGTATCGGATACTCAGGGGAACATAGCTGTGTTTGTGAAGAACTATCTCAAGTTTGAGTTCCACGCTTCATTACCTAGATACCATTGTTTGCCATCGGCTATGTCAATTGCTGGAAAGACTCTCATCATTGCTTATGTGGATCAGAAGATAATACAGTACGATCTGTCTCGTAAAAAACTAAGCAAGACAAATTTCTATGACGTTCCAAGTTTGAGCAAGAGGACTTGGGCTATAACTGGAGTGACGTCACATCCCGTGAGACCAGCAACAATCTTTTATGATGAGACTTCATTATCGGTCATGGAGAAAACCTTGGAGAATGGATCCTACGAACCAGCGCCAAAGATGAAGAACAAAAACGATAACAAGTACCACGGATTAAAAATTATTCCCTTCAAGTATTTGGTTGGATTCCACTGGCTGGGTGACGACGAGGCTGTATCGTTGGAAGTCTTACCCGAGAATATTATATCACAGCTACCGCCGGTGTTGGCTATGAAAAGACATTCATGA

Protein sequence:

>DPOGS213513-PA
MPCKLHRVRYYNPKPVQINCVSFNKSSKRIALARQDASIEIWDLNFAPLLVQCISGTEDTSVEALGWVHDRLLSTGLGGALIEWDLESLTIKFSVMLTGYAAWCLDVNSANTVVAVGTEQGYINLYSVENNEIVYKKLFDKQEGRIMCCKFDKTGNTLVTGSVDTIRVWNVEMGYATCRISVNRRGKETIVWSLAVLSDNTVVSGDSHGRLTFWDGNLGDQIESYTTHKADILSIVVSDDERSLYCSGVDPVITNFVKVNNSAGKQTCARWVKNVQRNIHEHDVRALVLNGEKLLSVGADGYLTLSSYPPKWVMRIPPMIPAPRSCVSARNKLLLLRYSNYLEIWKLGSYAINKNGNVTVNSVNVEPSVNSGSNQLEQDSQFISQVGKHTQKQSLKLIEQPTRLVCIQTKGKKQIRWCEMSPSGELVVYSTDSDLRMLKLDCDDDQSNVSLTKVFINGISHCDRVAFTADSRTLVAYGDGTVFILQVDPESGATVVQTLPCDQYLKMKSILHLVVSKDVSRHIYVVVSDTQGNIAVFVKNYLKFEFHASLPRYHCLPSAMSIAGKTLIIAYVDQKIIQYDLSRKKLSKTNFYDVPSLSKRTWAITGVTSHPVRPATIFYDETSLSVMEKTLENGSYEPAPKMKNKNDNKYHGLKIIPFKYLVGFHWLGDDEAVSLEVLPENIISQLPPVLAMKRHS-