Monarch geneset OGS2.0

DPOGS207638
TranscriptDPOGS207638-TA1161 bp
ProteinDPOGS207638-PA386 aa
Genomic positionDPSCF300199 + 126107-129104
RNAseq coverage7241x (Rank: top 2%)
Annotation
HeliconiusHMEL0119865e-14870.73% 
BombyxBGIBMGA006141-TA0.081.35% 
DrosophilaeIF-3p40-PB3e-13260.10% 
EBI UniRef50UniRef50_E0VY389e-14467.73%Eukaryotic translation initiation factor 3 subunit, putative n=10 Tax=Neoptera RepID=E0VY38_PEDHC
NCBI RefSeqNP_001036848.10.081.35%eukaryotic translation initiation factor 3 subunit H [Bombyx mori]
NCBI nr blastpgi|1129839060.081.35%eukaryotic translation initiation factor 3 subunit H [Bombyx mori]
NCBI nr blastxgi|1129839063e-16980.57%eukaryotic translation initiation factor 3 subunit H [Bombyx mori]
Group
Gene OntologyGO:00055152.2e-20protein binding
KEGG pathway 
InterPro domain[20-152] IPR0005552.2e-20Mov34/MPN/PAD-1
Orthology groupMCL15032 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207638-TA
ATGGCGAGTCGCGGTGGCCCATCTAGACGCGTTCCTGAAAATGAAGATGCCATTACATACGTTCAGTGTGATGGACTGGCGGTGATGAAAATAGTGAAACACTGTCATGAAGAATCATGCAGTAATATGGAAGTGGCTCAGGGTGCCCTACTCGGGCTTGTGGTTGAAAATCGATTGGAGATAACCAATTGCTTTCCATTCCCCAAACATGATGATACGATGGACGAGGAGGAGTATCAGCTCGACATGATGCGGAGATTACGTAGAGTTAACGTTGATCACTTCCATGTTGGATGGTACCAGAGTGCAGACGTGGGTAACTTCTTAAGTGAATCACTGTTAGAGTCCCAGTATCACTATCAGACATCCATCGAGGAAAGTGTTGTTGTTATTTACGACACTAAGAAGTCCGCTAGAGGCTTCTTGACTTTGAAAGCTTATCGTCTTACTCCTCAGGCGATTGCCATGTACAAAGAGAAGGATTACACGCCAGAGGCATTGCGTAACCTTAAAATAGGTTATGAGAACCTGTTCATTGAGGTTCCCATTGTGATCAGGAATTCACCGCTCACTAACATTATGATGTCCGAGATATCCGAGATGATCCCAGAAGAAGAAGGATCTAAGTTCCTTGATTTAGGAACAGCTTCCGTGCTTGAAGGGATTATACTGTCTCTTGAGTTTCACAAACTGCCTTCTGATACACCACTAATAGCTAGTGAAAAATATATGCTGTTACAGAAGACAGTTGGTTGTAACATAGAAAGCAGTCATAGACAGAACATGCGAAGATCTGGTACGACAGTTGGACAACTCCGAAGTCTAATGGAGCGTGTGGACGAATTGAACCAGGAAGCAATAAAGTTCAACCGTTACCAATTATCAGTTGTCCGTCAACAGCAAGAGAAGCATCGTTGGCTCTTGAAGCGGGCTCAGGAGAACGCTGCTCGAGCTGCCAAGGATGAAGCGCCTCTACCGGAAGAGGATGTGAACAAGCTCTTCAAACCCCTGCCGGTACCACCAAGGATGGTCCCAATGATAGTGGCCGGCCAAATCAACACTTACAGCCAACACATCAGCCAATTCTGCTCCCAAAGCCTGGCGAAACTGTATCTGACACAAGCATTACAGAACGCTAAGGAATCCAAGCAGAACAACTAA

Protein sequence:

>DPOGS207638-PA
MASRGGPSRRVPENEDAITYVQCDGLAVMKIVKHCHEESCSNMEVAQGALLGLVVENRLEITNCFPFPKHDDTMDEEEYQLDMMRRLRRVNVDHFHVGWYQSADVGNFLSESLLESQYHYQTSIEESVVVIYDTKKSARGFLTLKAYRLTPQAIAMYKEKDYTPEALRNLKIGYENLFIEVPIVIRNSPLTNIMMSEISEMIPEEEGSKFLDLGTASVLEGIILSLEFHKLPSDTPLIASEKYMLLQKTVGCNIESSHRQNMRRSGTTVGQLRSLMERVDELNQEAIKFNRYQLSVVRQQQEKHRWLLKRAQENAARAAKDEAPLPEEDVNKLFKPLPVPPRMVPMIVAGQINTYSQHISQFCSQSLAKLYLTQALQNAKESKQNN-