Monarch geneset OGS2.0

DPOGS211868
TranscriptDPOGS211868-TA1116 bp
ProteinDPOGS211868-PA371 aa
Genomic positionDPSCF300011 - 980116-984358
RNAseq coverage343x (Rank: top 34%)
Annotation
HeliconiusHMEL0177736e-8355.97% 
BombyxBGIBMGA001239-TA7e-8956.90% 
DrosophilaCG6907-PA1e-3841.76% 
EBI UniRef50UniRef50_Q2F6A51e-12059.42%Elongation protein 4-like protein n=2 Tax=Obtectomera RepID=Q2F6A5_BOMMO
NCBI RefSeqNP_001040120.12e-12159.42%elongation protein 4-like protein [Bombyx mori]
NCBI nr blastpgi|1140530235e-12059.42%elongation protein 4-like protein [Bombyx mori]
NCBI nr blastxgi|1140530238e-11658.89%elongation protein 4-like protein [Bombyx mori]
Group
Gene OntologyGO:00063573.7e-95regulation of transcription from RNA polymerase II promoter
GO:00335883.7e-95Elongator holoenzyme complex
KEGG pathway 
InterPro domain[11-371] IPR0087283.7e-95Elongator complex protein 4
Orthology groupMCL15276 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211868-TA
ATGTCGAGTTTTCAAAAAGCAGTTCCTCGATCTAGTTTAAGTGGTACAAAATTAAAAAATAACATTCCCTATGTATCTTCCGGAATACCATCCTTGGACTTTATAGTCGGAGGTGGCCTACCTCTCGGTTCTATTTTCGCCGTAGAGGAAGATGTCTTAGGCAGTTATAGCAGAGTTCTCTCAAAGTATTTCCTAGCTGAAGGAGTTGTGTGTAAACATGGCTTATTTGTTGCTTCTGCAGATGAAAATCCCAAGTTAATTATCCAAGAGCTCCCCACACCCTGTACAGTACAAGAAGACAAAACAAATCAAGATGTTAACAGTGAGATGAAAATCGCCTGGAGGTATGAAGGTCTGGGCCAAGTGGAGTCATCGTTTGGAAGCAACACCAACTTCGGCCATCACTTTGATCTAAGCAGGCACATGGAGGAGCTGGCTCTTAGAGATGTTGATTTAACTTATTGTCACTTGAAACCGAACAATGGAAAGAGCAATGGTGACTTATATACTGTTATTTTATATATATGGTTTAGGAACCGTCTGTATCATGATCTGTTGAAAGAAATCGGAAAGGTTTTATCCAAAGAAGAATATCGGAGCGGTAGTAAAAACAAAAATATACTTCGGATCAGCATTCAGTGTCTGGGTTCGCCTGTGTGGTTGGCCACTGACTGTGACCATGACAGCAAGTACGGACAGGATCTCATTAAGTTGATATATAGCTTAAGGGTTCTGATCAGAGACACCAATGCAGTTGTCTTCATCACAATTCCCGAGCATCTGTTTGAAAATAATCAAATAATGAAGAGGCTTCTCTACTCTATCGACAACGCGGTGCGGATCGAGTCGTTCGCCGGTTCCAGTAAGGAGACCAACCCGGTGTACAGCGAGTACCACGGGCTGTTCCACATCAGCAAGTTGTCGGGGGTCAGCTCCCTGGTGGCGTTCGTTCCTCCCAGCCTGGACCTGGCCTTCAAGCTGAAGAGGAAGAAGTTTGTCATCGAAAAGTTGCACCTGCCGCCAGAACTCCAAGAGTCCAGCGAGCGCGAGCAGGACGACGTCACGGCCACGCCCGGCACCTGCGGCGGCTTCAAGAAGAAAGATATAGACTTCTAA

Protein sequence:

>DPOGS211868-PA
MSSFQKAVPRSSLSGTKLKNNIPYVSSGIPSLDFIVGGGLPLGSIFAVEEDVLGSYSRVLSKYFLAEGVVCKHGLFVASADENPKLIIQELPTPCTVQEDKTNQDVNSEMKIAWRYEGLGQVESSFGSNTNFGHHFDLSRHMEELALRDVDLTYCHLKPNNGKSNGDLYTVILYIWFRNRLYHDLLKEIGKVLSKEEYRSGSKNKNILRISIQCLGSPVWLATDCDHDSKYGQDLIKLIYSLRVLIRDTNAVVFITIPEHLFENNQIMKRLLYSIDNAVRIESFAGSSKETNPVYSEYHGLFHISKLSGVSSLVAFVPPSLDLAFKLKRKKFVIEKLHLPPELQESSEREQDDVTATPGTCGGFKKKDIDF-