Monarch geneset OGS2.0

DPOGS215975
TranscriptDPOGS215975-TA1332 bp
ProteinDPOGS215975-PA443 aa
Genomic positionDPSCF300078 - 520067-523813
RNAseq coverage60x (Rank: top 68%)
Annotation
HeliconiusHMEL0048151e-10651.34% 
BombyxBGIBMGA000939-TA5e-8951.28% 
DrosophilaCG14689-PA6e-3528.11% 
EBI UniRef50UniRef50_D6WC581e-4633.72%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WC58_TRICA
NCBI RefSeqXP_972004.13e-4733.72%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
NCBI nr blastpgi|910780785e-4633.72%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
NCBI nr blastxgi|910780787e-4533.82%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL11818 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215975-TA
ATGTTTGAAGATGACGAGGACGAGATTTATGAAGCCATTTATGAAAACGGGCACTGGGTGTGGGATACCCAGGATGAAGCATTAAAATTTATAAGAAATGTGGTATCGGAATCGGATGTATATGCGCCACTATCAACCACTAAGCTCAAAACCATCGAGTTTAAAGATGACAAAGATCTTTGGGAGCAGCAAAAATTTCGAAAACGCATGCAGAGAAAAACAACTGATTTCGATGTCGTAACGCTTCAGAGGTGTCTCCCACAGGACGTGAAGGATGTGGTGTTATTTACAGCGCCAACTTCAATATTGTCTCCAGCAGTGATAAACATGTTACACCTGCCCACCACAGAGCGGTTTTTACGAGCTTTAATATTGTGCTGCCAGTACTACTTACAGGTATCAGAAATAATGACTAATCGAACTATAGAGCTGGAAACAAAGCTACGTACGGAAAATAGCGCGGAAATAGAAGCCAAATACGGAGATGATATGGAAGATCTGAGATTGCTCATCGCTAAGGATTATTATATGATGCTTCTGGGTGAGGGAGATTTTGCTAAATACCATCACATGGGTACTCAAAAATTACACTCACTTTCCAAAAAAGAGGCTGCTTTTTTCGAAACTATATTAAAGATAGCCATTCAAATTGTCTGGATTGCTCTAGGTCGAAAATATTACAATCAAATCGAATTGGAGGTAAATAGAATATTTAAATCTGATATTTACAACTCCGCGGAGCACAAATTAAATACAAACTATCAAGTTAAAATGAATGACAGAGAGCAGTCTGTTTTATTTGGACACTGCCTGCATCTTGGTAAGAAAGTTAACAGCAACTCACCTCTTATAAACGAGGTATACTGCCATCGTGGCATCGACTACCGTTTGTTTGGACTTGGGGCTATAAAATATCCCGGATTAAAAAGACGCTTAAGTTTTTTGGAAGGTATACTATCAGAACCAGAGGAAAAGTTTACAGAATATGGCTTTACTCTTGGTATTTTAGGACTCGCTCGTTCGAGGTTTGATATAATGTTGAAAGAAATCAAAGCCCCTGCAGGCGCAGGCAGCGTGTCTTCAGCTAGCATCAGGCATAGTTTATCAAGGACATCAAGAGTAATGTCTCGTAAAAGCACGGCATCCGCTATTCCACAGAAGCTGTATCCAAACATTTATATACCAAGGAAGGACGAAATAAATGACATCCTTGCCGACTTTTGTAAAGAATCTCTTCCAAAGCTAAAAAGGAATGAAGAACAACGTTTGAAATGGATCCACCGCATCAGTGGAAGAAAAGCTCTTAGGAAAGTTATAAAAAAAGTTCCTTGA

Protein sequence:

>DPOGS215975-PA
MFEDDEDEIYEAIYENGHWVWDTQDEALKFIRNVVSESDVYAPLSTTKLKTIEFKDDKDLWEQQKFRKRMQRKTTDFDVVTLQRCLPQDVKDVVLFTAPTSILSPAVINMLHLPTTERFLRALILCCQYYLQVSEIMTNRTIELETKLRTENSAEIEAKYGDDMEDLRLLIAKDYYMMLLGEGDFAKYHHMGTQKLHSLSKKEAAFFETILKIAIQIVWIALGRKYYNQIELEVNRIFKSDIYNSAEHKLNTNYQVKMNDREQSVLFGHCLHLGKKVNSNSPLINEVYCHRGIDYRLFGLGAIKYPGLKRRLSFLEGILSEPEEKFTEYGFTLGILGLARSRFDIMLKEIKAPAGAGSVSSASIRHSLSRTSRVMSRKSTASAIPQKLYPNIYIPRKDEINDILADFCKESLPKLKRNEEQRLKWIHRISGRKALRKVIKKVP-