Monarch geneset OGS2.0

DPOGS201198
TranscriptDPOGS201198-TA1632 bp
ProteinDPOGS201198-PA543 aa
Genomic positionDPSCF300262 + 237080-245352
RNAseq coverage178x (Rank: top 50%)
Annotation
HeliconiusHMEL0047441e-8053.80% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_C3YCT96e-1238.19%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3YCT9_BRAFL
NCBI RefSeqXP_001807190.12e-1440.17%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892402143e-1340.17%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892402141e-1442.15%PREDICTED: hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL34387 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201198-TA
ATGGGAGCGCCGTCGTTACCTTCGGTGCTGCCACTGGCACGACATTTCCCTCAGTGGAGCCGAACTGCGATACTCACCATTACCAATGGAACATCAGAGACCGATGAGCTCAAGAAATCGTCTCCTGAGTCTACGACCAGCGGGTTGTTCATAACAGGTGATGCAGCGGCCCGCGTGGTTCACCTGGAGCACAGTGTGCGCTTCCTCCAAGAACAACATAGACTGATGCTGAGCGGACTCCACGCAGAGATAGAGGCCTTGAGAGAGAGAAATAGAGACTTGCAATTCCAATTAATATTCAACAAGGAGACGCCGAAGTCCGCTACGGCTGTCGAGGACGAAACTAATGAGGAGCGTCTTCGTAAGGAGGTGAGTCGTCTGGAGCGAGAGGCGAGTGTGGCGCGGGGGGAGGCGCGGGCGGCCGGGGTCAGGGAGCTCCAGCTGCAGAGGTTACTCGACGAGCAGACTGAAAAGCTGCGGGAGCTGGAGCTGCGGCAGGTCGGTGGGGTGACCCCGGGCGGGGGCGGGGGTGCGGAGGAGGAGAGCCACGCCGAACTGCGCGCCTGCGGGCCGACGCCGAGCGGCAGCGCAGAGAGGTACTCGGGGGCGCGGGCGTCACCAACCTACTGTTCGACGTATGTTACTGACGCGACCCTCTCCGTCTCTCTTCAGCTACAGTGTATGAAGAACTCTCTCCACGCGAGTCTGCGCGCCAGCGGCCTGGACGCATTAGGTTACCAGAATACTTACGCCTTTCCACCAGTATACGGCGAGTTTTGGAGAGAGCCGCTCAGAGAAGAGTACAGTATGGGATCAAGAAATAGAAATAATAGAAGACCGCTGACACTGCCAGAGCTAAACTCTGCTGCACGTTCGACAGTGTATGCGAATAATCATACGCGCGTTCGAGGTTACGGTTACGACAAAAATAAAAAGAGCCAGCCAACAAATGGAGAAACAACAAACGGGAAGAATCCCGAAGATCCGGCGGCATCAGAAATGTCACGATTAAAAACAATAAAGAACGATAATTTAAATAGAACCAAAACGACTGAGCCATTGTCCACATACCCCGAGTTGAAAATAGTGTATGTTACTGACGCGACCCTCTCCGTCTCTCTTCAGCTACAGTGTATGAAGAACTCTCTCCACGCGAGTCTGCGCGCCAGCGGCCTGGACGCATTAGGTTACCAGAATACTTATGCCTTTCCACCAGTATACGGCGAGTTTTGGAGAGAGCCGCTCAGAGAAGAGTACAGTATGGGATCAAGAAATAGAAATAATAGAAGACCGCTGACACTGCCAGAGCTAAACTCTGCTGCACGTTCGACAGTGTATGCGAATAATCATACGCGCGTTCGAGGTTACGGTTACGACAAAAATAAAAAGAGTCAGCCAACAAATGGAGAAACAACAAACGGGAAGAATCCCGAAGACCCGGCGGCATCAGAAATGTCACGATTAAAAACAATAAAGAACGATAATTTAAATAGAACCAAAACGACTGAGCCATTGTCCACATACCCCGAGTTGAAAATAGTGGTAACTAAATTAATACATCACACCGACGTGAGCGCGGGCGAGTCTCGGTCGCGGCGTCGCACGCACCGCAAACACTCGGACCACACGTAG

Protein sequence:

>DPOGS201198-PA
MGAPSLPSVLPLARHFPQWSRTAILTITNGTSETDELKKSSPESTTSGLFITGDAAARVVHLEHSVRFLQEQHRLMLSGLHAEIEALRERNRDLQFQLIFNKETPKSATAVEDETNEERLRKEVSRLEREASVARGEARAAGVRELQLQRLLDEQTEKLRELELRQVGGVTPGGGGGAEEESHAELRACGPTPSGSAERYSGARASPTYCSTYVTDATLSVSLQLQCMKNSLHASLRASGLDALGYQNTYAFPPVYGEFWREPLREEYSMGSRNRNNRRPLTLPELNSAARSTVYANNHTRVRGYGYDKNKKSQPTNGETTNGKNPEDPAASEMSRLKTIKNDNLNRTKTTEPLSTYPELKIVYVTDATLSVSLQLQCMKNSLHASLRASGLDALGYQNTYAFPPVYGEFWREPLREEYSMGSRNRNNRRPLTLPELNSAARSTVYANNHTRVRGYGYDKNKKSQPTNGETTNGKNPEDPAASEMSRLKTIKNDNLNRTKTTEPLSTYPELKIVVTKLIHHTDVSAGESRSRRRTHRKHSDHT-