Monarch geneset OGS2.0

DPOGS209129
TranscriptDPOGS209129-TA573 bp
ProteinDPOGS209129-PA190 aa
Genomic positionDPSCF300061 - 1264916-1266983
RNAseq coverage651x (Rank: top 20%)
Annotation
HeliconiusHMEL0202062e-6263.10% 
BombyxBGIBMGA002071-TA1e-6162.72% 
DrosophilaCG34424-PA1e-2536.81% 
EBI UniRef50UniRef50_UPI000224702E1e-3447.47%UPI000224702E related cluster n=4 Tax=unknown RepID=UPI000224702E
NCBI RefSeqXP_974831.23e-3242.86%PREDICTED: similar to 5,10-methenyltetrahydrofolate synthetase (5-formyltetrahydrofolate cyclo-ligase) [Tribolium castaneum]
NCBI nr blastpgi|3323744483e-3450.99%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3123813755e-3648.39%hypothetical protein AND_06331 [Anopheles darlingi]
Group
Gene OntologyGO:00055241.4e-48ATP binding
GO:00302721.4e-485-formyltetrahydrofolate cyclo-ligase activity
GO:00093961.4e-48folic acid-containing compound biosynthetic process
KEGG pathwaytca:6637039e-32 
 K01934 (E6.3.3.2)maps-> One carbon pool by folate
InterPro domain[16-168] IPR0026981.4e-485-formyltetrahydrofolate cyclo-ligase
[24-174] IPR0241854.5e-365-formyltetrahydrofolate cyclo-ligase-like domain
Orthology groupMCL11259 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209129-TA
ATGTTGTTACGGAATATCAGGGATTGTTATAGCAAAGTGTTCAGAATGAGTCGTGTACCAAACCCAGCGAAGGCTGCTATCAGAGAGGAGATCGAAAAGAGATTGGCAACTCTCACAAACGAGGAAAAGAAAAGACAGTCGGATATTGTTTTTAATAAGTTAATCAAGCATCCGTTTTATAAATCCGCGAATAGGATCTCGGTATTCATGAGCACGCCCACCGAGGTGGACACGGCGCCCATCATAGAGCACGTTAAGGGCAAGGGCGGGGAGGTGTTCGTGCCGCAGTACGCCGGCGGCGTCATGAAGATGTTGAAGCTGGAGACGGGAGACGAGCGCGACATGCCGGCCACGAGGCACGGCATACGACAACACGCGAAACACGCGCCCAGAGAAGACGCCTTGGAGAAAGGATTAGATCTGATCATAGCTCCTGGCGTCGCCTTCTCCCAGGACGGTGGTCGCGTGGGTCACGGAGGGGGATACTATGACAAATACATCAGCAACCTCAGATCCCACCCCGGAACAGCTCCTAAAGTGAGATCCAACGATTATTGTTGTTCAAATCAATAA

Protein sequence:

>DPOGS209129-PA
MLLRNIRDCYSKVFRMSRVPNPAKAAIREEIEKRLATLTNEEKKRQSDIVFNKLIKHPFYKSANRISVFMSTPTEVDTAPIIEHVKGKGGEVFVPQYAGGVMKMLKLETGDERDMPATRHGIRQHAKHAPREDALEKGLDLIIAPGVAFSQDGGRVGHGGGYYDKYISNLRSHPGTAPKVRSNDYCCSNQ-