Monarch geneset OGS2.0

DPOGS205616
TranscriptDPOGS205616-TA1161 bp
ProteinDPOGS205616-PA386 aa
Genomic positionDPSCF300023 - 997942-1008064
RNAseq coverage1132x (Rank: top 11%)
Annotation
HeliconiusHMEL0073471e-10374.43% 
BombyxBGIBMGA001129-TA1e-9693.75% 
Drosophilasd-PF9e-14665.40% 
EBI UniRef50UniRef50_D6WAX53e-16474.56%Putative uncharacterized protein n=5 Tax=Coelomata RepID=D6WAX5_TRICA
NCBI RefSeqXP_001601555.10.080.20%PREDICTED: similar to Transcriptional enhancer factor TEF-1 (TEA domain family member 1) (TEAD-1) (Protein GT-IIC) (Transcription factor 13) (NTEF-1) [Nasonia vitripennis]
NCBI nr blastpgi|3504203432e-18077.32%PREDICTED: transcriptional enhancer factor TEF-1-like [Bombus impatiens]
NCBI nr blastxgi|3639871642e-17577.18%scalloped [Thermobia domestica]
Group
Gene OntologyGO:00056341.6e-154nucleus
GO:00063551.6e-154regulation of transcription, DNA-dependent
GO:00037001.6e-154sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[5-386] IPR0008180TEA/ATTS
[1-386] IPR0163615.5e-233Transcriptional enhancer factor
Orthology groupMCL10563 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205616-TA
ATGTCAGCTGCGGATGCGGAGGGAGTCTGGAGTCCTGACATCGAGCAAAGCTTCCAAGAGGCGCTGGCGATATATCCGCCATGCGGAAGGCGCAAAATCATACTCTCCGATGAGGGCAAGATGTACGGTCGTAACGAGCTGATAGCGAGATATATTAAACTAAGGACGGGAAAGACGCGCACAAGGAAACAGGTCTCCTCGCATATACAGGTGTTAGCGAGGCGGAAACTCAGAGAGATACAGGCCAAACTGAAAGTGGACGGTGGTATGAAAGACAAGGCGATGCAGTCAATGAGCACGCTGTCTAGTGCACAGATAGTGGCTGGATTGCCGCATCCGGCGTACCATCACACGCAATTCTGGCAACCGGGACTACAAGCGGGAACGTCACAAGATGTGAAGCCTTTCCCTGGTGCGTCGTATAAGGGTGTGGGGGTGGGGCCCGTGGGGGTGTCTGGAGGTGCGGAGGTGGCGCCGCCGCCATGGGAGGGTAGAGCCATAGCCACACACAAGCTAAGACTAGTTGAATTCTCAGCGTTCGTGGAACACCCAAGAGATCCTGATACGTTCCCTCCCAGTGCTGGAGCACAACACCTGTTCGTTCATATCGGAGGAACCGTGACGTATGCGGACCCACTACTGGAGTCGGTGGATGTGCAGCAAATAAACGATAAATTCCCAGAGAAGAAAGGCGGTTTGAAGGAATTGTATGAGAAGGGACCAAGGAACGCTTTCTTCCTTGTGAAGTTCTGGGCTGACCTTAACACCAACAACTTGGATGATCCGGGGGCGTTCTATGGGGTCACCAGTGTTTACGAGAGCAACGAGAACATGACTATAACCTGCAGCACCAAGGTGTGTTCGTTCGGCAAGCAGGTGGTGGAGAAGGTGGAGACGGAGTACGCGAGGTTCGAGGGCGGGCGGTTCGTGTACCGCATCACGCGCTCGCCCATGTGCGAGTACATGGTCAACTTCATACACAAGCTGAAGCACCTGCCCGAGAAGTACATGATGAACAGCGTGCTGGAGAACTTCACCATACTACAGGTGGTGTCAAACCGCGACACTCAAGAGACGTTGTTATGTGCCGCGTTCGTGTTCGAGGTGTCCAACAGTGAGCACGGGGCGCAGCATCACATCTACCGGCTCGTCAAAGACTGA

Protein sequence:

>DPOGS205616-PA
MSAADAEGVWSPDIEQSFQEALAIYPPCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQVLARRKLREIQAKLKVDGGMKDKAMQSMSTLSSAQIVAGLPHPAYHHTQFWQPGLQAGTSQDVKPFPGASYKGVGVGPVGVSGGAEVAPPPWEGRAIATHKLRLVEFSAFVEHPRDPDTFPPSAGAQHLFVHIGGTVTYADPLLESVDVQQINDKFPEKKGGLKELYEKGPRNAFFLVKFWADLNTNNLDDPGAFYGVTSVYESNENMTITCSTKVCSFGKQVVEKVETEYARFEGGRFVYRITRSPMCEYMVNFIHKLKHLPEKYMMNSVLENFTILQVVSNRDTQETLLCAAFVFEVSNSEHGAQHHIYRLVKD-