Monarch geneset OGS2.0

DPOGS200699
TranscriptDPOGS200699-TA1416 bp
ProteinDPOGS200699-PA471 aa
Genomic positionDPSCF300343 - 119656-124161
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0174685e-17386.02% 
BombyxBGIBMGA008308-TA4e-15774.21% 
DrosophilaCon-PA6e-5434.77% 
EBI UniRef50UniRef50_D1ZZZ75e-7838.43%Putative uncharacterized protein GLEAN_08134 n=1 Tax=Tribolium castaneum RepID=D1ZZZ7_TRICA
NCBI RefSeqXP_001816009.19e-7938.43%PREDICTED: similar to AGAP002006-PA [Tribolium castaneum]
NCBI nr blastpgi|1892365062e-7738.43%PREDICTED: similar to AGAP002006-PA [Tribolium castaneum]
NCBI nr blastxgi|1892365066e-7636.72%PREDICTED: similar to AGAP002006-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25463 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200699-TA
ATGGAACAAGTTGGCAAGTACTTCGTTATAATTGCATCATTAATGTCTGTTGAACTAACTGTCACGGAAAGTAAAAATGAAACTAAACCGAAAATTGATAGACGGCACTATCAAATGCCACTCTTTATCAATGTCTGCGATATTGTCGATCGAAGATCCAAAGTACACTGCTACTGTGACAGTAATAAGCCCAAAATGGCAACTAGAGCAGACTGCTGGATATTCGGTGGCGGATTAACCGAGGATGATCCGATTTGGCCAAGCTTTTCCTCTCAATCCGAAATAGAACATCTCATGTTTAACGTTCGAACCGATGATGCGTTGAGTTTTGTACCAGCTAAAGCCATAGTTAGGCTTGATAGACTGAAACACTTGTCTATACAGTTTGGAACGATCAAAAGAATTTATCCATATGCATTTACAAACTCATCAACTTTAAAACAGATTGTTTTGAAGTCTAATAAAATTGCACATTTGGAAAAACATGCTTTTGCCCAAATGATGATGCTCTCAAACTTGAGTCTGGATGATAATCAAATAACTGAACTTAAGAGAGAGGTATTTTTTAATTTACCAAATTTGCACATATTAAATTTATCAAATAATAATTTAAGTCTTTTGCACGAAGTTATTACAAGAGAAACGTTTGAGGGGTTGGAAAATCTATCTCGTTTAAACTTGAGAAACAATAAGTTGGCTATGGTCGGTAACTTAGCGTTTAGCGAGTTGTGGGGATTGAAAGAATTGATGTTAGATAACAACGGTATCGAATACATCTCAGAAAGGGCTTTCGGAGGACTAACACAGTTAAAGAAGTTGACCTTATCTGGGAACAAGTTGGCAACTTTCTATGATGACATTCTGGAAGATATGAGGAGTCTTAGTGTCTTGGATCTGAGAGACAACCTATTGACGACAATATCATATGAAACCATACGGCCCATTTTGAACAACGAAAAATCTCAGTCATCAGTAGTTTATTTGGATGGTAACCCACTTAGCTGTAACTGCCGTTTATCTTGGATATATGTGCTCCGTAATGAAACTCAGGATACCACTATGAAGCATGCTCTTGAAAAAATATCTTGCGTCTCAGACCCCACAAATGATAGACGAATAAGTGAACCAAAAGAGGAAGAGGTCGAAAATAGTAACATTTTAGCAGATGATCATTATGACTACTATGATAAAAGTGACGATTATAGTAATGACAAAGAAAAAACTAAAATAGGTACAGCCATTAGGCTGATAGATATACCCTTAGAGACTCTCCCTTGTCCGAAAGAATTAATGCAGTCGATTGAGGAAACGTACGGCCATCCAGTGCAAAATGAGATACGATTAAAAGCCTTCTCTAGAGTTGGGAGAGATTTACCTAACTTTCTATTCTTCTTGACATTGTTACTATTATTTTAA

Protein sequence:

>DPOGS200699-PA
MEQVGKYFVIIASLMSVELTVTESKNETKPKIDRRHYQMPLFINVCDIVDRRSKVHCYCDSNKPKMATRADCWIFGGGLTEDDPIWPSFSSQSEIEHLMFNVRTDDALSFVPAKAIVRLDRLKHLSIQFGTIKRIYPYAFTNSSTLKQIVLKSNKIAHLEKHAFAQMMMLSNLSLDDNQITELKREVFFNLPNLHILNLSNNNLSLLHEVITRETFEGLENLSRLNLRNNKLAMVGNLAFSELWGLKELMLDNNGIEYISERAFGGLTQLKKLTLSGNKLATFYDDILEDMRSLSVLDLRDNLLTTISYETIRPILNNEKSQSSVVYLDGNPLSCNCRLSWIYVLRNETQDTTMKHALEKISCVSDPTNDRRISEPKEEEVENSNILADDHYDYYDKSDDYSNDKEKTKIGTAIRLIDIPLETLPCPKELMQSIEETYGHPVQNEIRLKAFSRVGRDLPNFLFFLTLLLLF-