Monarch geneset OGS2.0

DPOGS210312
TranscriptDPOGS210312-TA1275 bp
ProteinDPOGS210312-PA424 aa
Genomic positionDPSCF300025 - 991143-994743
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0217384e-17670.18% 
BombyxBGIBMGA011878-TA0.073.30% 
DrosophilaCG3706-PA1e-5832.86% 
EBI UniRef50UniRef50_E9H6414e-10143.99%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H641_DAPPU
NCBI RefSeqXP_002731927.11e-6733.58%PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
NCBI nr blastpgi|3214617122e-10043.99%hypothetical protein DAPPUDRAFT_308079 [Daphnia pulex]
NCBI nr blastxgi|3214617122e-9844.20%hypothetical protein DAPPUDRAFT_308079 [Daphnia pulex]
Group
KEGG pathwaybfo:BRAFLDRAFT_1210212e-15 
 K01007 (E2.7.9.2, ppsA)maps-> Reductive carboxylate cycle (CO2 fixation)
    Pyruvate metabolism
Orthology groupMCL17578 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210312-TA
ATGACACCTTTATTCATCACAGCTGTTATATTTATAGTATTTATTGTGATTTATCTAAAGAGAAAAGATCCCGAACCGATATTCGGCGTGTATACTGTAGCAGGAAAGTGGTACTATTTGAAATATGTCGCCTTTTCCTGTATTTATTATTATAGACGGTACTCGAATAAGAGCAAAGCGGTCGGTGCTGATGGTGGAGCGGGTCAAGGTGTGAAGGCTATCTCAGACCCGAAAGAAATGGACAAGGCTCAGCCCCTGAGTGACCACGCTAAGGCTTTCGATGCGGTATTTTTCATATCCGCAAAGAAGGACGAACATGACAAGGGAATATACGTAATCGCTGGTTGTGAGAGACGACCTTTGGGAATGTGCAATGGACTTTTCTACATTGGGTTGCCAGGAAAAGGACTTTTGTGCAGTAAAAAGATTCCGGACACGGTCCTTTTCGGTGCACAAATCGGTGAATTTGGAGCTGAAGGGGTTCTTATTACCCCCGCGGAACCGATGAAGAAATGGACCGTCTCTTATAAAGGACCTATGTGGTATCAAAATGAGCCCAGCAAAATAGTAGAAGTAGAATTCAATGGTGAATGGACAGCGACGAGCAACTACTTTGATTACGACACCGATTTACACCCTCCAGCTGTAATCCGATCAATTGCTAGAGAAAAGTGGAGTCGAAAATACTTTAATAACCTGAAAACAGCTCACCAATCTCACTATGAACAGTTCGGCGTAATGAAGTGTAAATTTACTATTGAAAAGGAATCCTTTGAATTCACCTTACCCTCCTTCAGGGATCACAGCTTTGGTCAAAAGCGGGACTGGACGCTTATGCACAGATACGCCTTCCATCATATTTTCTTACATGACGGCACCAACATCAGCGTTGGAGTCATCTGTCAGCCTTCCACCGCGACACGCATGGAGGTCGGCTACGTTAGTCTTCCGAGCGGTGAGACTTTGCCTGTCGAGTGGGTGGAGATGCAGTTGTACCAACACGGGGAGGGCGGCGCCGCGCCTAAAGACTACGCGTTCAGGATAAAGGCTGGAGATGTTGTTTACATTGTTCAGGTGTTGGTGGAATACGAGTCTATACACTTTGTGTCTCAAGATTGGGACGCCCGAATGGTGGAGCGCTTCTGCAAGTTTGTGGTGAACGGCGTCCCGGGGCGAGGGGTGTCTGAGTTCCATTACAGACACCACGGAGGACGGCCAGATGAGGTCGCGCAGAATGACCCCGAGTGGTACAGGAAGATGTGCCATAAGATATAG

Protein sequence:

>DPOGS210312-PA
MTPLFITAVIFIVFIVIYLKRKDPEPIFGVYTVAGKWYYLKYVAFSCIYYYRRYSNKSKAVGADGGAGQGVKAISDPKEMDKAQPLSDHAKAFDAVFFISAKKDEHDKGIYVIAGCERRPLGMCNGLFYIGLPGKGLLCSKKIPDTVLFGAQIGEFGAEGVLITPAEPMKKWTVSYKGPMWYQNEPSKIVEVEFNGEWTATSNYFDYDTDLHPPAVIRSIAREKWSRKYFNNLKTAHQSHYEQFGVMKCKFTIEKESFEFTLPSFRDHSFGQKRDWTLMHRYAFHHIFLHDGTNISVGVICQPSTATRMEVGYVSLPSGETLPVEWVEMQLYQHGEGGAAPKDYAFRIKAGDVVYIVQVLVEYESIHFVSQDWDARMVERFCKFVVNGVPGRGVSEFHYRHHGGRPDEVAQNDPEWYRKMCHKI-