Monarch geneset OGS2.0

DPOGS214331
TranscriptDPOGS214331-TA1074 bp
ProteinDPOGS214331-PA357 aa
Genomic positionDPSCF300020 - 558192-562147
RNAseq coverage321x (Rank: top 36%)
Annotation
HeliconiusHMEL0076067e-17280.56% 
BombyxBGIBMGA003989-TA5e-14769.97% 
DrosophilaCG10512-PA2e-11154.26% 
EBI UniRef50UniRef50_A7UUE03e-11155.65%AGAP006576-PA n=26 Tax=Eumetazoa RepID=A7UUE0_ANOGA
NCBI RefSeqXP_316604.44e-11255.65%AGAP006576-PC [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582960858e-11155.65%AGAP006576-PC [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582960859e-10955.65%AGAP006576-PC [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00081525.5e-137metabolic process
GO:00551145.5e-137oxidation-reduction process
GO:00164915.5e-137oxidoreductase activity
KEGG pathwaypab:PAB17911e-55 
 K05884 (E1.1.1.272)maps-> Cysteine and methionine metabolism
InterPro domain[1-357] IPR0037675.5e-137Malate/L-lactate dehydrogenase
Orthology groupMCL13445 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214331-TA
ATGGGTAAAGTCGCTACTACAGAGGCGCTTCGTTTTATGACTGATTGCTTAAAAGCTGCGGGAGCTCGTGCTTATCCTGCACACCAACAAGCAGAATTATTACTGGAAGCCGATCGACTTGGACATCCGAGCCACGGACTTAACAGATTAGAATATTACGTCAATGATATCTTAGGCGGTGGCTGTCAACCCAACAATCAGCCAAAGATCTTAAAGCAGAGCCCGTCGACGGCGTGGGTTGATGCCCAAAATGTTCTGGGGGCCACAGTCAGTCATTTTGCAATGGACATCGCTATTGAGAAAGTAAAACAGACAGGCGTCGGATGGGTAACTGTTAAAGCATCAAATCATAATGGCATGGCGGGTTTCTGGGCAAAAAAGGCTGCAGATCAGGGCTTAATTGGTATGGCATTCACAAATACATCCCCGCTACTAGCACCAACGCGAAGCAAAAAGTCTGCTCTCGGTACTAACCCGTTGTCAGTTGTGGCGCCCGGAGCTGATGGAGAGACCTTTTACCTGGACATGGCCACCACAGCTGTTGCCGTGGGAAAGATAGAAATGAAAAGGCGGAAAGGTGAGACATTGCCCAATGGTTGGGCTCAGGGTCCTGACGGCAAAGAAACACAAGACGCTGAACTGGCTTTCAACACGGGATGTTTGTTCCCGCTAGGTGGTAGGGAAGAAACCTCAGGCTACAAAGGCTACGGTTTGGCCGCCATGGTAGAACTCTTCTGCGGTATCTCGTCAGGTTCAAACTACGGGCACCACATCCGCTCGTGGTCCCACAGCGGCGAGGGCGGATCATCCAACATTGGACACTGCTTCGTTGCAGTCAACTTCGAGAACTTCGCGCCTGGCTTTAAAGACAGGCTTGCGGATTGTATGACGCATTGGAGGAATTTGGATCCGGCGGATGAAAAGCTGCCAGTGTTAGCGCCCGGGGACAAAGAGAAGGAAGCAGCACAACTGACTGATGCGAGCGGCACGGTCTCCTACGTCGAGCAGCAGATAAAGTCTAGCTCCGCTCTGGCTGAACGATTAAAAGTTACTCCCATGGAGTTATGCCGTTAA

Protein sequence:

>DPOGS214331-PA
MGKVATTEALRFMTDCLKAAGARAYPAHQQAELLLEADRLGHPSHGLNRLEYYVNDILGGGCQPNNQPKILKQSPSTAWVDAQNVLGATVSHFAMDIAIEKVKQTGVGWVTVKASNHNGMAGFWAKKAADQGLIGMAFTNTSPLLAPTRSKKSALGTNPLSVVAPGADGETFYLDMATTAVAVGKIEMKRRKGETLPNGWAQGPDGKETQDAELAFNTGCLFPLGGREETSGYKGYGLAAMVELFCGISSGSNYGHHIRSWSHSGEGGSSNIGHCFVAVNFENFAPGFKDRLADCMTHWRNLDPADEKLPVLAPGDKEKEAAQLTDASGTVSYVEQQIKSSSALAERLKVTPMELCR-