Monarch geneset OGS2.0

DPOGS211201
TranscriptDPOGS211201-TA1266 bp
ProteinDPOGS211201-PA421 aa
Genomic positionDPSCF300007 + 851213-854530
RNAseq coverage22672x (Rank: top 0%)
Annotation
HeliconiusHMEL0124530.095.97% 
BombyxBGIBMGA003186-TA0.093.35% 
DrosophilaeIF-4a-PA0.079.70% 
EBI UniRef50UniRef50_Q142405e-17474.14%Eukaryotic initiation factor 4A-II n=630 Tax=root RepID=IF4A2_HUMAN
NCBI RefSeqNP_001037376.10.093.11%eukaryotic translation initiation factor 4A [Bombyx mori]
NCBI nr blastpgi|1646834380.093.84%eukaryotic initiation factor 4A [Plutella xylostella]
NCBI nr blastxgi|1646834380.093.84%eukaryotic initiation factor 4A [Plutella xylostella]
Group
Gene OntologyGO:00055242.5e-47ATP binding
GO:00080262.5e-47ATP-dependent helicase activity
GO:00036762.5e-47nucleic acid binding
GO:00043862.2e-33helicase activity
KEGG pathway 
InterPro domain[67-264] IPR0140011e-61DEAD-like helicase
[73-236] IPR0115452.5e-47DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[301-382] IPR0016502.2e-33Helicase, C-terminal
Orthology groupMCL11315 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211201-TA
ATGTCCTATTCACCTGAAAGAAGATCGGAAGATTGGCCGGAGGATCCTAAGAATGGGCCGTCAAAGGATCAAGGCAGTTACGATGGACCTCCGGGTATGGAGCCCGGCGGCGCCCTCGACACAAACTGGCATGAAGTCGTGGAAAGCTTTGATGACATGAAGTTGAAAGAGGAATTGTTGCGAGGAATTTACGCTTACGGTTTTGAGAAACCTTCAGCCATCCAACAACGCGCAATCATGCCCTGCATCAAGGGTCGTGACGTTATAGCTCAAGCCCAGTCTGGTACTGGCAAAACTGCTACTTTCTCAATTTCGATTCTACAACATATTGATACTAGTGTTCGCGAGTGTCAAGCTTTGATTTTAGCTCCGACTCGGGAATTAGCTCAACAAATTCAGAAGGTTGTGATTGCCCTGGGTGATCATCTGAATGCTAAGTGCCATGCTTGTATCGGTGGTACTAATGTACGTGAGGATATACGTCAGCTGGAGAGTGGTGTCCATGTTGTTGTGGGCACCCCTGGTCGTGTATATGATATGATCACTCGTCGGGCACTCCGCGCTAACACGATTAAGCTGTTTGTGCTCGACGAGGCTGATGAAATGTTGTCAAGAGGATTTAAAGATCAAATCCATGATGTCTTCAAGATGTTGTCATCTGATGTTCAAGTCATTTTGCTATCTGCTACTATGCCTGATGATGTGTTAGAGGTCTCCCGCTGCTTTATGAGAGATCCTGTTCGTATTCTTGTGCAGAAGGAAGAGCTCACACTGGAAGGTATTAAACAGTTCTTCATCTCAATTGATATTGAAGATTGGAAATTGGACACACTCTGTGACTTGTATGATACTCTTTCTATCGCCCAAGCCGTCATCTTCTGTAACACACGTCGCAAGGTCGACTGGCTGACCGAGTCCATGCATCAGCGTGATTTCACGGTATCTGCCATGCACGGTGACATGGACCAGCGCGAGCGTGAAGTGATCATGCGTCAGTTCCGCACAGGCTCGTCCCGTGTACTGATCACAACCGATCTGTTGGCGCGAGGCATCGACGTGCAGCAAGTGTCCTGCGTCATCAACTACGATCTACCCACCAACCGAGAGAATTACATCCATCGTATCGGACGAGGTGGTCGTTTCGGTCGTAAGGGGATCGCCATCAACTTTGTGACTGAAGCTGACAAGAGGGCGCTGAAGGATATCGAGGAGTTTTACCATACTACTATTACGGAAATGCCCAACGATGTGGCTAACCTCATCTGA

Protein sequence:

>DPOGS211201-PA
MSYSPERRSEDWPEDPKNGPSKDQGSYDGPPGMEPGGALDTNWHEVVESFDDMKLKEELLRGIYAYGFEKPSAIQQRAIMPCIKGRDVIAQAQSGTGKTATFSISILQHIDTSVRECQALILAPTRELAQQIQKVVIALGDHLNAKCHACIGGTNVREDIRQLESGVHVVVGTPGRVYDMITRRALRANTIKLFVLDEADEMLSRGFKDQIHDVFKMLSSDVQVILLSATMPDDVLEVSRCFMRDPVRILVQKEELTLEGIKQFFISIDIEDWKLDTLCDLYDTLSIAQAVIFCNTRRKVDWLTESMHQRDFTVSAMHGDMDQREREVIMRQFRTGSSRVLITTDLLARGIDVQQVSCVINYDLPTNRENYIHRIGRGGRFGRKGIAINFVTEADKRALKDIEEFYHTTITEMPNDVANLI-