Monarch geneset OGS2.0

DPOGS209723
TranscriptDPOGS209723-TA1554 bp
ProteinDPOGS209723-PA517 aa
Genomic positionDPSCF300105 - 15179-20292
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0080170.070.83% 
BombyxBGIBMGA008934-TA0.068.88% 
DrosophilaCG14590-PA1e-9539.40% 
EBI UniRef50UniRef50_D6WX492e-12544.57%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WX49_TRICA
NCBI RefSeqXP_971416.12e-12745.59%PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
NCBI nr blastpgi|910888573e-12645.59%PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
NCBI nr blastxgi|910888573e-13345.59%PREDICTED: similar to CG14590 CG14590-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL16512 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209723-TA
ATGGGAAATAACGAATATGAGATCAGAAAATCAGAGGAACTTGGTCGGTACTTAGTAGCAGCAAGAGATCTTACCCCAGATGACGTAGTTCTTACAGAGTTGCCCCTTGTTTATGGACCTAAATCCATGCCTGACCCAGAGGCTTTGATGCCGTGTGTGGGATGTTATAAACCTATATTTACTGATGTCGGTGAAAGATGCTCCAGATGTGGCTGGCCGGTTTGTTCTGGCAACTGTCCAGGTCTGAAAGATCCTCTCCATCACGGTGTTGAATGTGAAATACTGAGCGCGCGTCCAGAATGTGTTTTGGACAACATGGCTGATTATTACCGACACGATGCTCTACTTCCATTACGCTGTGCTCTTCTTCAATATACAGATGATGATAAATGGAAAAAACTTTTAGAACTTCAGTCACATATGGAATGTCGAGTACCGGGAACTGATGCTTATGATGAAGCAGATGAATTCACTGTTAAATATCTTATGAACGTTTTCATCAACAAGATTGATAAAAATGTTAAAAATAAATATCTTAATCTTATATCTGGAGAACTGTTGCACAAAATTTGTGGCATAATCGACACTAATGCGTTAGAAATAAGACTACCAAATGGATCTGAACTTAATGCATTATACGCAACAACATGCATGATGGAACACAGCTGTGTCCCAAACACAAAGCATTTATTCAACACATCAGGAAAAGACGTTAAAGACAAATATAAAATTACAGTGAAAGTCGTCGTACCCATTAACAAAGGGGATCACGTAGCGACAATGTACAGCCATGCTTTATGGGGAACACAAGCCAGACGGCAACATTTGAAAGATACCAAATATTTCTCCTGCAAGTGCATACGTTGTAGTGACCCTACCGAGTTGGGAACTTATTTAAGCGCCATGAAGTGTTTTGGCGATGATAAGGGTCCATGTGATGGAATTCATTTACCTGAAGACCCGCTAGATGAAGAAACCGACTGGGCTTGTAATAAGTGTACAGTAAAAGTAAATAATTCTCAAATAAATATCCTCATATCTGAAATGGGTGAAGAGGTAGAGAATGTGCAAATGATGGGAGGTTCTGTGAACATGTTAGAAAATATTTTATGTCGATTATCGACATTTCTACATCCAAATCATTATCACCTGTATTCAATTAAACATTCACTTATACAGTTGTATGGGAGACAATCTAGTTATATGTCTGAAGAAATATTGGACAAGAAAATTAAGATGTGCAAGGATTTGATTTTCATAACGAAGACTTTGGATCCAGGTAACGCAAGACTTAGTTTATACTCCGCAATCCTGCATCATGAGTTACATTCAGCTTTGGTACTGAAGTCAAAAAAGGCTACTAAAGATGGATCACTTAGAACTGTCGATGAAATTAGACCATTGATTGATGAAGCGAAATTGTCAATAGAAGCAGCTTTATCATCCCTAAAAGATGATGTTGAGGAGACATCAGGTAAAAAATTGCATGAAGTTATCGAAAAAAGTAGACTTGACTTCGTCAACTACTGTGAATCAAAAAACATCAACATTTAA

Protein sequence:

>DPOGS209723-PA
MGNNEYEIRKSEELGRYLVAARDLTPDDVVLTELPLVYGPKSMPDPEALMPCVGCYKPIFTDVGERCSRCGWPVCSGNCPGLKDPLHHGVECEILSARPECVLDNMADYYRHDALLPLRCALLQYTDDDKWKKLLELQSHMECRVPGTDAYDEADEFTVKYLMNVFINKIDKNVKNKYLNLISGELLHKICGIIDTNALEIRLPNGSELNALYATTCMMEHSCVPNTKHLFNTSGKDVKDKYKITVKVVVPINKGDHVATMYSHALWGTQARRQHLKDTKYFSCKCIRCSDPTELGTYLSAMKCFGDDKGPCDGIHLPEDPLDEETDWACNKCTVKVNNSQINILISEMGEEVENVQMMGGSVNMLENILCRLSTFLHPNHYHLYSIKHSLIQLYGRQSSYMSEEILDKKIKMCKDLIFITKTLDPGNARLSLYSAILHHELHSALVLKSKKATKDGSLRTVDEIRPLIDEAKLSIEAALSSLKDDVEETSGKKLHEVIEKSRLDFVNYCESKNINI-