Monarch geneset OGS2.0

DPOGS200141
TranscriptDPOGS200141-TA975 bp
ProteinDPOGS200141-PA324 aa
Genomic positionDPSCF300128 - 363976-365100
RNAseq coverage562x (Rank: top 22%)
Annotation
HeliconiusHMEL0075552e-12067.21% 
BombyxBGIBMGA014593-TA2e-13168.77% 
DrosophilaCG4036-PA7e-7945.81% 
EBI UniRef50UniRef50_Q7PLZ92e-9051.89%AGAP009588-PA n=13 Tax=Pancrustacea RepID=Q7PLZ9_ANOGA
NCBI RefSeqXP_623503.11e-9350.33%PREDICTED: similar to CG4036-PA [Apis mellifera]
NCBI nr blastpgi|3320280685e-9853.57%Alkylated DNA repair protein alkB-like protein 4 [Acromyrmex echinatior]
NCBI nr blastxgi|3320280681e-9953.57%Alkylated DNA repair protein alkB-like protein 4 [Acromyrmex echinatior]
Group
Gene OntologyGO:00167063.8e-05oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00551143.8e-05oxidation-reduction process
GO:00164913.8e-05oxidoreductase activity
KEGG pathway 
Orthology groupMCL14251 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200141-TA
ATGAAGCCTCGCCCTTGTGGCTGTAAAGGCTGCAGAACGTGTCTCATTTGTGAAACTTATTATGGGGCCGAAGAACTTAAGAATCTGGTAAAGCTTGACAAAGACAAAGGTTATGTATTTTGCCCATTCTGTAACAAAGCTTGGAGTGGCTGGGATATAGACGTATATAAACAACATCCATATCACGAGGGCGAATCGATTGACTACCCAGGTGTTTATATAAAATTAGATTTTATATCTGAATATGAAGAAACAGAACTAATGAGAAACATAGATGAAGTGCCGTGGGATATATCGCAGAGCGGCCGGCGTAAACAGAACTATGGACCAAAAACTAATTTTAAAAAGAAAAAAATCGTTCCTGGACAATTCAATGGGTTTCCTAAGTTTTCCCAGTATTTGCAGGACAGATTCAAAACTTTCGATATATTAAAAGGTTATGAAGTCATTGAGCAGTGTTCTCTTGAGTATGATCCTATGAAGGGGGCGTCTATTGACCCTCATATCGATGACTGCTGGGTCTGGGGTGAGAGAATACTCACAGTTAACTGTTTGTCAGACTCCGTTCTTACAATGACTCCATTCAAAGGCGACACAATAAAGTATAATCTATATTGTGCAAAGGAGTATCCTCCTGTTGTCCAGGACGATGGCACTGTCGATATGGATTTTACGAGCAAACAAAACATGTTTGAAGCTAGTAAACCAAAAGAGGATTTAGATGTAATAATAAGGATTCCGTTGATAAGGCGATCTCTACTAATAATCTACGGTGAATCGAGGTACCATTGGGAGCACCGGGTTTTACGTGAGGACATTGTGTCAAGAAGAGTTTGTATCGCATATCGCGAGTTCACTCCACCTTTCATGAACAACGGTGTCCATGAGATCCTCGGCAGAGAAATCCGAGACCGAGCAAAGTTGTTCTGGGACCACAGACAGAGATACAAGGAGCAGATCAAATGCGTACAATAA

Protein sequence:

>DPOGS200141-PA
MKPRPCGCKGCRTCLICETYYGAEELKNLVKLDKDKGYVFCPFCNKAWSGWDIDVYKQHPYHEGESIDYPGVYIKLDFISEYEETELMRNIDEVPWDISQSGRRKQNYGPKTNFKKKKIVPGQFNGFPKFSQYLQDRFKTFDILKGYEVIEQCSLEYDPMKGASIDPHIDDCWVWGERILTVNCLSDSVLTMTPFKGDTIKYNLYCAKEYPPVVQDDGTVDMDFTSKQNMFEASKPKEDLDVIIRIPLIRRSLLIIYGESRYHWEHRVLREDIVSRRVCIAYREFTPPFMNNGVHEILGREIRDRAKLFWDHRQRYKEQIKCVQ-