Monarch geneset OGS2.0

DPOGS214941
TranscriptDPOGS214941-TA1218 bp
ProteinDPOGS214941-PA405 aa
Genomic positionDPSCF300280 - 123365-125446
RNAseq coverage1889x (Rank: top 7%)
Annotation
HeliconiusHMEL0155940.099.75% 
BombyxBGIBMGA004822-TA0.099.01% 
DrosophilaeIF4AIII-PA0.094.90% 
EBI UniRef50UniRef50_P389190.086.67%Eukaryotic initiation factor 4A-III n=170 Tax=root RepID=IF4A3_HUMAN
NCBI RefSeqNP_001106217.10.099.01%eukaryotic initiation factor 4A-III [Bombyx mori]
NCBI nr blastpgi|1638386740.099.01%eukaryotic initiation factor 4A-III [Bombyx mori]
NCBI nr blastxgi|1638386740.099.01%eukaryotic initiation factor 4A-III [Bombyx mori]
Group
Gene OntologyGO:00055242.6e-43ATP binding
GO:00080262.6e-43ATP-dependent helicase activity
GO:00036762.6e-43nucleic acid binding
GO:00043865.9e-35helicase activity
KEGG pathwayaag:AaeL_AAEL0144140.0 
 K13025 (eIF-4A3, EIF4A3)maps-> Spliceosome
InterPro domain[51-248] IPR0140012.1e-59DEAD-like helicase
[57-219] IPR0115452.6e-43DNA/RNA helicase, DEAD/DEAH box type, N-terminal
[285-366] IPR0016505.9e-35Helicase, C-terminal
Orthology groupMCL12174 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214941-TA
ATGACGGCATCGGAAGTTTCATATAATCGTAAAATAATATCAGAGGATTTATCTAATGTTGAGTTCGACACCAGCGAAGATGTGGAAGTTATCCCGACATTTGATTCCATGGGCCTTCGAGATGAATTATTACGCGGCATTTATACTTACGGTTTCGAGAAACCTTCCGCCATTCAGCAGAGAAGTATACTACCTATTGTGAAGGGCCGCGATGTTATAGCTCAGGCTCAGTCCGGTACGGGCAAAACAGCTACATTTTCCATATCCATACTTCAGTCCTTGGATACAACTCTTCGTGAAACACAGGTCCTGATTTTGTCCCCTACTCGAGAATTGGCCACACAGATTCAAAAGGTTATTCTGGCTCTGGGAGACTTTATGAATGTACAGTGCCATGCATGTATTGGTGGTACTAATCTTGGGGAAGATATCAGAAAACTTGATTATGGGCAGCATGTTGTATCAGGAACACCCGGCAGAGTTTTCGATATGATCAGAAGGAGAGTGCTCCGTACAAGGTCTATTAAGATGCTAGTCCTCGATGAAGCTGACGAAATGTTGAATAAAGGATTCAAGGAACAAATTTATGATGTCTACCGTTATCTACCTCCGGCTACACAAGTTGTTCTTATATCCGCAACTCTACCCCATGAGATTCTGGAAATGACATCAAAGTTTATGACCGATCCTATTAGGATTCTTGTAAAACGTGATGAGTTGACATTGGAAGGTATTAAACAGTTCTTTGTTGCCGTGGAAAGAGAGGAGTGGAAATTTGATACTCTCTGTGACTTGTATGACACATTAACAATCACTCAGGCGGTGATCTTCTGTAACACTAAAAGAAAGGTTGACTGGCTCACTCAGAAAATGCAGGAAGCCAACTTCACAGTGAGCTCCATGCATGGAGATATGCCTCAAAAGGAAAGAGACAATATCATGAAGGAGTTCCGTTCAGGACAAAGCCGGGTGCTTATAACAACAGATGTATGGGCCAGGGGTATAGATGTACAGCAGGTGTCCCTAGTCATCAACTATGACTTGCCAAATAATCGTGAATTGTACATTCATAGAATCGGCAGATCAGGTCGTTTCGGTCGTAAGGGTGTTGCGATTAACTTTGTGAAGTCCGACGACATAAGGATCCTTAGAGACATAGAGCAGTACTACTCCACACAGATTGACGAGATGCCAATGAATGTTGCTGACTTAATATAA

Protein sequence:

>DPOGS214941-PA
MTASEVSYNRKIISEDLSNVEFDTSEDVEVIPTFDSMGLRDELLRGIYTYGFEKPSAIQQRSILPIVKGRDVIAQAQSGTGKTATFSISILQSLDTTLRETQVLILSPTRELATQIQKVILALGDFMNVQCHACIGGTNLGEDIRKLDYGQHVVSGTPGRVFDMIRRRVLRTRSIKMLVLDEADEMLNKGFKEQIYDVYRYLPPATQVVLISATLPHEILEMTSKFMTDPIRILVKRDELTLEGIKQFFVAVEREEWKFDTLCDLYDTLTITQAVIFCNTKRKVDWLTQKMQEANFTVSSMHGDMPQKERDNIMKEFRSGQSRVLITTDVWARGIDVQQVSLVINYDLPNNRELYIHRIGRSGRFGRKGVAINFVKSDDIRILRDIEQYYSTQIDEMPMNVADLI-