Monarch geneset OGS2.0

DPOGS210336
TranscriptDPOGS210336-TA1203 bp
ProteinDPOGS210336-PA400 aa
Genomic positionDPSCF300025 - 459278-460859
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0138272e-16573.25% 
BombyxBGIBMGA011977-TA5e-17172.79% 
DrosophilaCG5274-PA6e-6136.17% 
EBI UniRef50UniRef50_D6WGD53e-7742.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WGD5_TRICA
NCBI RefSeqXP_971716.15e-7842.61%PREDICTED: similar to CG5274 CG5274-PA [Tribolium castaneum]
NCBI nr blastpgi|910920949e-7742.61%PREDICTED: similar to CG5274 CG5274-PA [Tribolium castaneum]
NCBI nr blastxgi|910920942e-7442.61%PREDICTED: similar to CG5274 CG5274-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14083 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210336-TA
ATGTCTGCATCAGATTTAAAACATACTTTGAGGAAATTTGAATTTCCAGCATGTGCTAAAGAAGCGCTTATTAAGATTGAGCAATTATTAGTAGGCCGGGCGGCACCGACTAGTAAACAACTCGACATAGCAATGGACATAATATCAGAGTTCGTATTTTGTGAAACCGATCGTAGAAGTAACCCACGTCGAGGTCTCAATCCTCTTCAAGAATTACAACTTATTGATATTATATGCGATTATTTATGCGCTTGTACAAATGATACAACTAAAAATACAATATTCCTATCTTTATTCGGGGGCAAAGAGAGTCAGCGAAAATTAAAAATTCTTAGTATTCTAGCAAGTATGGCAGTATCGGCATCTAGTACACCAGTTTTACTGGCTGTAGGTGTGTGGCTCCAACAAATGGGTTGTTCATCTTCCCAATCCCTACAACTAGCAGAGAATATTGTTAGGGATCACTTCTACCTGAACACATCCAATCAGAATGTTTTAAAGACACTAGCAACAGATGCTCCTCAGTTTGTATCCAATTTTATAACTGCTGTTACTGAGTTATATCTACATGACATACAGAAACCTAAAAAAATGCCCCCAAAGAATCTCTTGGAAGTAATTACTTCTTGGGTTTATACAAATCCCTCTCTCTGTATGTCGGCACAACTCAATCCGGCTGCCCTGCCTATCGGCTCCATTCCAATGGCTGCCGTTACTCCTCTGGCTGGATTAATACACTGGTGTGCACTTGCACCACTTTATGTGGATGAAGATATTGACACCGATCCTATTCCTATCAAGAAAATCAAACTTGAAGAAGAAAAACATACAATTGTAAAGTCAGTTACTAGTAAATCTCTCTCCGAGACAGAGTTGTATATAAAATTGCACCTCGGAGTCTTACATAGCTTGCGTGCTGGCAAGAGAACGCACGGTCCACCTACAGCAGTAAATGCACAGCATCTTGTGGCTCTGACTCCAATAGTGCAGGCCTACGCACATAATTTAATAAAATGTGGAGTTAAATTTCAGTCCGACAATAGGTTACAGGACTGTTTAGATCGAATTGGTCAAGCTGTTCAAGTAGCACTTGCAAATGGATGTGTGTATGGAAACATAAACAATCTCCTCGCCGCCCTGAATTCTTTACCAGAGAACAGATTGCTCTGCATCATCATAAAGAGCCATCACCAGTCTATTTGA

Protein sequence:

>DPOGS210336-PA
MSASDLKHTLRKFEFPACAKEALIKIEQLLVGRAAPTSKQLDIAMDIISEFVFCETDRRSNPRRGLNPLQELQLIDIICDYLCACTNDTTKNTIFLSLFGGKESQRKLKILSILASMAVSASSTPVLLAVGVWLQQMGCSSSQSLQLAENIVRDHFYLNTSNQNVLKTLATDAPQFVSNFITAVTELYLHDIQKPKKMPPKNLLEVITSWVYTNPSLCMSAQLNPAALPIGSIPMAAVTPLAGLIHWCALAPLYVDEDIDTDPIPIKKIKLEEEKHTIVKSVTSKSLSETELYIKLHLGVLHSLRAGKRTHGPPTAVNAQHLVALTPIVQAYAHNLIKCGVKFQSDNRLQDCLDRIGQAVQVALANGCVYGNINNLLAALNSLPENRLLCIIIKSHHQSI-