Monarch geneset OGS2.0

DPOGS215541
TranscriptDPOGS215541-TA1329 bp
ProteinDPOGS215541-PA442 aa
Genomic positionDPSCF300129 - 254920-264811
RNAseq coverage4042x (Rank: top 3%)
Annotation
HeliconiusHMEL0116231e-11060.93% 
BombyxBGIBMGA002295-TA2e-12379.55% 
Drosophilagrowl-PA7e-4933.56% 
EBI UniRef50UniRef50_F4WP286e-8040.49%Methenyltetrahydrofolate synthetase domain-containing protein n=7 Tax=Formicidae RepID=F4WP28_ACREC
NCBI RefSeqXP_395864.36e-8338.20%PREDICTED: similar to CG14648-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3504129221e-8741.24%PREDICTED: methenyltetrahydrofolate synthase domain-containing protein-like [Bombus impatiens]
NCBI nr blastxgi|3504129223e-8440.49%PREDICTED: methenyltetrahydrofolate synthase domain-containing protein-like [Bombus impatiens]
Group
Gene OntologyGO:00055243.8e-27ATP binding
GO:00302723.8e-275-formyltetrahydrofolate cyclo-ligase activity
GO:00093963.8e-27folic acid-containing compound biosynthetic process
KEGG pathwaytca:6549488e-78 
 K01934 (E6.3.3.2)maps-> One carbon pool by folate
InterPro domain[24-206] IPR0026983.8e-275-formyltetrahydrofolate cyclo-ligase
[23-207] IPR0241852.5e-205-formyltetrahydrofolate cyclo-ligase-like domain
Orthology groupMCL12725 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215541-TA
ATGCAGAACGGAGACGAAACTCCGGCAGCTACCGAAACAGCAAAGAAGCCACTACCAGAGGAGGTAACGAAACAATCGTTCAGGCAGAAGATATGGCGTCATTTGGAGACCAACGGACTGGCAATGTTCCCACGACCGGTGTACAACAGGATACCGAACTTCAAGGGGGCTCTGGAAGCAGCGGCTAAGTTGGCAGAATTGGATGTGTTCAAGAATGCCAACACGGTGAAGGTCAATCCTGATAAGCCTCAAGAACCAGTCAGGGTGCTATGCCTGGAGAAGCACAAGACTCTATACGTGCCGGTTCCTCGTCTTCAGTCGGGCTTCCTGAACCGAATCGTCCTCCCGGAGGGTGAAGCTAAGCCGGGCACACTGAGGAAGGCCGTCTCAAGGAACGGAATGGAATCGTTCGGACAACCACTGACCATAGAAGATTCGGTCTCCTTGGACTTGGTCGTGATGGGATCCGTTGCTGTCTCCAAGGAGGGATATCGCATTGGAAAAGGAAAAGGGTACGGGGATCTAGAGTTTGGCCTGATGATGCACATGAAGGCTATCAAACCTAACACGCTGGTTGTGACAACTGTGCACGACTGTCAGGTGTTTGATACACTTCCAGCTGCACTGAAGCCAGGTGTGATAGAGACCCAGCGTATGAGCCAACGGCCAGTCAGTATATTGTGGCATCTGCTGTCACAGCGACGCCTGGAGATGATGCCAGTGCTGGGGCAACTCAGAGACATTGAGATGCTTGCTGGGCGTTCGTGTACTCTTCGTGAGGAGGACAGCGCTGGCGAGGAGGAACGAGCGAGGCCGCGCCGACAGAGAAGGAGGACCAGGAGCCATAAGAGTCATAGCGAGGGAGAGGGCAACACGACGGAGGGCGAGGACGGTAAAAATAATAAGCCTCGGCGTCCGCGGCGCCGCAGCACCAAGTCTCTGAGTAAGGACGGCGAGGGGAGGGAGGGGAAGGAGGGGAGGGAGGGGAAACCCAAACGACCACGACGTCCGAGACCGGTCATTGACTTCACTGTTAAGATTTCAAACATCAGTCCCAACACTCGTGTGCGTGACATCAAATCGGCTCTCTTCGAACGCGGTGTTAAACCGCACGTTATGATTTGGAAGGGTGAAGGGGACATCCCAGCAGCGAACATGGACAGTGTGCTAGCGGCTCTGGCTCAGATGTCTGTGGGTGGTTCAGGGGGAGCACCTGACGAGCGCGAGGAGAAGCCCCGCCTGCTGACAGTGGAGCCGGCGCCGCCGAGGCACGCGGCCGCACCCGCCGCCGCACCAGCCGCGGCTGAGGCGCCGCCCGCGGTACATTAA

Protein sequence:

>DPOGS215541-PA
MQNGDETPAATETAKKPLPEEVTKQSFRQKIWRHLETNGLAMFPRPVYNRIPNFKGALEAAAKLAELDVFKNANTVKVNPDKPQEPVRVLCLEKHKTLYVPVPRLQSGFLNRIVLPEGEAKPGTLRKAVSRNGMESFGQPLTIEDSVSLDLVVMGSVAVSKEGYRIGKGKGYGDLEFGLMMHMKAIKPNTLVVTTVHDCQVFDTLPAALKPGVIETQRMSQRPVSILWHLLSQRRLEMMPVLGQLRDIEMLAGRSCTLREEDSAGEEERARPRRQRRRTRSHKSHSEGEGNTTEGEDGKNNKPRRPRRRSTKSLSKDGEGREGKEGREGKPKRPRRPRPVIDFTVKISNISPNTRVRDIKSALFERGVKPHVMIWKGEGDIPAANMDSVLAALAQMSVGGSGGAPDEREEKPRLLTVEPAPPRHAAAPAAAPAAAEAPPAVH-