Monarch geneset OGS2.0

DPOGS208904
TranscriptDPOGS208904-TA1056 bp
ProteinDPOGS208904-PA351 aa
Genomic positionDPSCF300009 - 617702-622241
RNAseq coverage623x (Rank: top 21%)
Annotation
HeliconiusHMEL0146750.090.31% 
BombyxBGIBMGA002478-TA1e-17584.10% 
DrosophilaCG11334-PB3e-15673.18% 
EBI UniRef50UniRef50_Q9V9X44e-15473.18%Methylthioribose-1-phosphate isomerase n=55 Tax=cellular organisms RepID=MTNA_DROME
NCBI RefSeqNP_001037650.13e-17484.39%methylthioribose-1-phosphate isomerase [Bombyx mori]
NCBI nr blastpgi|1129833346e-17384.39%methylthioribose-1-phosphate isomerase [Bombyx mori]
NCBI nr blastxgi|1129833341e-16584.39%methylthioribose-1-phosphate isomerase [Bombyx mori]
Group
Gene OntologyGO:00442371.5e-169cellular metabolic process
GO:00442492e-147cellular biosynthetic process
KEGG pathwaydrm:Dred_20641e-77 
 K08963 (mtnA)maps-> Cysteine and methionine metabolism
InterPro domain[1-344] IPR0006491.5e-169Initiation factor 2B-related
[5-342] IPR0052512e-147Putative translation initiation factor, aIF-2BI/5-methylthioribose-1-phosphate isomerase
[32-343] IPR0115593.4e-110Initiation factor 2B alpha/beta/delta
Orthology groupMCL14459 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208904-TA
ATGAGCTTAGAATCAATAAAATATGAGAGAGGTAAATTGGAAATTTTGGATCAGCTGTTACTGCCACTCCAAACTCGATATGTAAAAGTGCAAGGTGTTGAAGATGGCTGGAAAGTTATTAATAAAATGCAAGTCAGGGGAGCACCAGCAATAGCTATAGTTGGATGCCTATCATTGGCTATAGAGCTTATGAAAGATGACTGCTCCGACAAAAAAATTATGAGGCAAGAGATTGAAGGAAAATTAAATTATCTTGTGTCGGCGCGACCGACAGCTGTTAATATAAAATTAGCCGGTGATGAGCTTATTAATTTCGCAAATAGTTTGTGTGCGGATGAAAATGTAACCGCTGATGATTTCAGAGAGAGGTTTATAAGGAGCATAGAAAATATGTTGATAAAAGACATTGAGGACAACAAGGCTATTGGCAAGTATGGTTGTGAAGCTATACTCAAGACCATAGAGGGTGATGGTCATGTTAGAATTCTGACACATTGCAACACTGGGTCCCTAGCCACGGCTGGATATGGGACCGCATTGGGAGTTATAAGATCATTGAATGCTGCCAAGAAATTAGAACATGTATACTGTACAGAGACCCGTCCATACAATCAGGGAGCGAGGCTTACTGCGTATGAATTAGTTCATGAGAAAATACCCGCAACACTAGTTGTAGATAGCATGGTAGCAGCAATCATGCATGCAAGGAATATTAGTGCAGTGGTTGTTGGAGCTGATCGGGTCGCTGCCAATGGTGATACTGCAAATAAAATAGGAACATACCAAATAGCAATAGTGGCAAAACACCACAATGTGCCTTTTTATGTAGCAGCCCCGCTCACATCTATAGACATGTCATTGAAGTCAGGAGACAGGATCAAGATAGAGGAACGGCCGGATAGAGAAATGACACATATTGGTGAACATAGAATTGCTGCTCCAGGTATAAATTGTTGGAATCCCTCATTCGATGTGACGCCAGCGGCACTTATTACAGGAATTATTACTGAGAAAGGCGTCTTCGCCCCGGATAAATTATATGAAGCAGTTCCCTAA

Protein sequence:

>DPOGS208904-PA
MSLESIKYERGKLEILDQLLLPLQTRYVKVQGVEDGWKVINKMQVRGAPAIAIVGCLSLAIELMKDDCSDKKIMRQEIEGKLNYLVSARPTAVNIKLAGDELINFANSLCADENVTADDFRERFIRSIENMLIKDIEDNKAIGKYGCEAILKTIEGDGHVRILTHCNTGSLATAGYGTALGVIRSLNAAKKLEHVYCTETRPYNQGARLTAYELVHEKIPATLVVDSMVAAIMHARNISAVVVGADRVAANGDTANKIGTYQIAIVAKHHNVPFYVAAPLTSIDMSLKSGDRIKIEERPDREMTHIGEHRIAAPGINCWNPSFDVTPAALITGIITEKGVFAPDKLYEAVP-