Monarch geneset OGS2.0

DPOGS204504
TranscriptDPOGS204504-TA2736 bp
ProteinDPOGS204504-PA911 aa
Genomic positionDPSCF300002 + 1727001-1733098
RNAseq coverage520x (Rank: top 24%)
Annotation
HeliconiusHMEL0130770.076.15% 
BombyxBGIBMGA007848-TA0.078.49% 
DrosophilaCG6385-PA0.049.83% 
EBI UniRef50UniRef50_A1ZAZ20.049.83%CG6385 n=27 Tax=cellular organisms RepID=A1ZAZ2_DROME
NCBI RefSeqXP_308638.40.056.31%AGAP007123-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582862370.056.31%AGAP007123-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582862370.056.31%AGAP007123-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00551143.1e-68oxidation-reduction process
GO:00164913.1e-68oxidoreductase activity
GO:00065461.8e-43glycine catabolic process
GO:00040471.8e-43aminomethyltransferase activity
GO:00057371.8e-43cytoplasm
KEGG pathwaydpo:Dpse_GA195550.0 
 K00314 (E1.5.99.1)maps-> Glycine, serine and threonine metabolism
InterPro domain[36-394] IPR0060763.1e-68FAD dependent oxidoreductase
[537-770] IPR0062221.8e-43Glycine cleavage T-protein, N-terminal
[775-870] IPR0139773.1e-14Glycine cleavage T-protein, C-terminal barrel
Orthology groupMCL12721 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204504-TA
ATGTTTAAAGTAATTAGAGATAGGTGTATAAGAACGAAAGCGGCGAGTTACTTGAAGAGCACAAACGGGAGAAAGTATTACAGCAATGAAGTAGTTACATCAGCCGATATTGTTGTCATAGGTGGTGGTATCGCGGGCTGTAATACTTTATATCAGTTGTCGAAAAGAGGAGTCAACGCTGTTCTCTTGGAAAGGAACAAATTGACGAGTGGTACAACATGGCATACGGCTGGTCTGGTATGGTCTCTCCGACCCAGTGATTTGGAAATTAAATTGCTACAAGATTCTAGACAAGTTTATAGTTCACTAGAGCAAGAGACAGGAGATTATGCAGGATGGATCAATAATGGAGGCATGTTTATATCGCGAAGCAAGCTTCGAACGGAAGAATACTTAAGGTTACACACATTAGGGAAAGCAATGGGTATTCCGAGCGAAATTCTTGATCCAAATGAAGCTCAGAAGTTATTCCCACTTTTGGACCCATCAGTATTTAAAATGGCACTCTATTCACCTCTCGACGGCACTATAGATCCCGCAATGGCTTGTAACGCCCTCGTCAAAGCAGCCTCCAAGAATGGTGGAAAGATATACGAAGATTGTCCGGTAATTGATATTCATTACGCTCATAACTTGCTCGGACATAAAGAAGTCACCGGGGTTCATACAGAAAAGGGATTTATCAGGACCAAGTGTGTAGTAAATTGTGGTGGTGTATGGGGTGCTCGTATAGCAAGGTTCGCTGGAGTGCCATCTTTGCCACTGATTCCATTTAAACACGCCTACGTAGTATCCGACGCCATCCCAGAAATCAGAGGTTGCCCGAACGTTAGGGACCATGATGTCAATTTATATTTCAAAATACAAGGCGAAAGCTGTAATATTGGTGGATATGAAAATAATCCAATAATGCTTGATCAGGTTGCAGATAGTCAAAGCTTCCATTTATACGATTTGGACTGGGATGTGTTCAGTGTTCATATGAACAGTGCTACGTCACTTTGTCCGAAACTCGGAAAAGTAGGGATAAAAAGCACAGTTTGTGGTCCCGAGTCATTTACACCCGACCACAAGCCGCTAATGGGAGAAGATTGCAATGTTTTTGGTTTGTACCATAACTGTGGATACAATTCCTCCGGCATGATGTTTTCTGCTGGGACAGCCATACAATTAGCTGAATGGATCATTAGTGGAAGGCCACACTACAACATGTTTACATTCGACATTGCTCGTTTTACGTCCGGCCAACTAGCACGACCGCACTGGGTCCGTGAGAGCAGTCACGAGGCGTATGTTAAGAACTACAGTATCGTGTTCCTGAACGACGAACCCCTCGCTGGTAGAGATGCGAGTCATGACGCTCTTCATCAGGAGCTGATAGATGATGGCGCTGTGATGCAGGCGAGAGCCGGCTGGGAACGTCCAGGGTTCTTTGTACCCGGGGAAAAGATCAGGGTCCAACAATATGACTGGGGTGGCGTGAATGACTACCCTCGTAATTTGGATCAAAGATATGAAGATCTTCTTAGAGGAGATTACACGTTCGGATTTTCAAAACATCATGATATCATCGGGTCGGAGGCGTTGGCGTGTAGGAATGCTGCGGCTTTATTCAATATGTCTTACTACGGAAAGTTTTACTTAACAGGACCTGATGCTCAGAGAACTGCTGAACTAGCTTTTACCGCTGACTTGAGCAAGAAACATGATGGTGTTGTTTATACTCTTATACTCAATGAGAAAGGTGGAGTGGAGGCAGATTTGACTGTCAGCGTCCTTGATGGAGGAAGTGGGCAGCTACATGAACCGATATTTAAAGGTCGTGGTTACTACGTGGTGACAAGCGGCTTCAGTGCGAATCACACAGCGTCTATCATCCGTCACATTATTTACAAACACAAACTTCGCGCCAATCTCACTGACGTTAGCAAACAGCTTTGCATTCTAGCCATTAATGGTCCTAACAGCCAGCGCATACTGCAGGGATACACAAGCGCGGGTCTATCAAATGATGCTTTCCCGTTGTACTCACATCGCAGCATCAAGGTATCCAAAGGGCCTCACTCTCCTGACAATAAGGCCTACACTTGTCGTGCTTTAAGAGTGTCTTGGACAGGCGAGCTCGGCTGGGAGCTTCATGTGCCTTCCTCGCACGCAGTTCAAGTCTACAAAGCTTTAACTCAGAACAGCGGACTGAAAAACGCTGGCTGGAGATCACTTACATCTTTAAGTACTGAAAAAGGTTTCCATCTTTGGAATGCTGATCTGAGAACCGATGATAATCCGGTAGAAGCAAACCTGTCATTTGCATGTCGCAAGGATGGGGAGTACATTGGCAACGAAAGCGTAACAAGGGCTAGGCAAAATGGAGTAACAAAGAAATATGCCTTCTTCACCCTCGACGATAAGGTCGCATTATTTGGACAAGAAGCTATATACAGAAATGGAGAGCCTGTCGGCCACCTTCGAAGAGGGGATTATGGCTTCTTCCTTGACAAATCCATTGGTGTAGGCTACGTTACCAATAATGGTTCAATGGTCACTAAAAATTACTTACAGGATGGTGAATACGAAATTGAAGTTATGGGAAAAAGATACAAAGCCAACCTTCACCTAAAGTCTCCATTTGATCCAAAAGGACAAAGGATGCTTGGTAATTATGGAGAAATGGGCATGGATGAAAACACACACGAACCTCATGCTGGACAAAATGAAAGAGCTGGTGGTAGCGAATAG

Protein sequence:

>DPOGS204504-PA
MFKVIRDRCIRTKAASYLKSTNGRKYYSNEVVTSADIVVIGGGIAGCNTLYQLSKRGVNAVLLERNKLTSGTTWHTAGLVWSLRPSDLEIKLLQDSRQVYSSLEQETGDYAGWINNGGMFISRSKLRTEEYLRLHTLGKAMGIPSEILDPNEAQKLFPLLDPSVFKMALYSPLDGTIDPAMACNALVKAASKNGGKIYEDCPVIDIHYAHNLLGHKEVTGVHTEKGFIRTKCVVNCGGVWGARIARFAGVPSLPLIPFKHAYVVSDAIPEIRGCPNVRDHDVNLYFKIQGESCNIGGYENNPIMLDQVADSQSFHLYDLDWDVFSVHMNSATSLCPKLGKVGIKSTVCGPESFTPDHKPLMGEDCNVFGLYHNCGYNSSGMMFSAGTAIQLAEWIISGRPHYNMFTFDIARFTSGQLARPHWVRESSHEAYVKNYSIVFLNDEPLAGRDASHDALHQELIDDGAVMQARAGWERPGFFVPGEKIRVQQYDWGGVNDYPRNLDQRYEDLLRGDYTFGFSKHHDIIGSEALACRNAAALFNMSYYGKFYLTGPDAQRTAELAFTADLSKKHDGVVYTLILNEKGGVEADLTVSVLDGGSGQLHEPIFKGRGYYVVTSGFSANHTASIIRHIIYKHKLRANLTDVSKQLCILAINGPNSQRILQGYTSAGLSNDAFPLYSHRSIKVSKGPHSPDNKAYTCRALRVSWTGELGWELHVPSSHAVQVYKALTQNSGLKNAGWRSLTSLSTEKGFHLWNADLRTDDNPVEANLSFACRKDGEYIGNESVTRARQNGVTKKYAFFTLDDKVALFGQEAIYRNGEPVGHLRRGDYGFFLDKSIGVGYVTNNGSMVTKNYLQDGEYEIEVMGKRYKANLHLKSPFDPKGQRMLGNYGEMGMDENTHEPHAGQNERAGGSE-