Monarch geneset OGS2.0

DPOGS208465
TranscriptDPOGS208465-TA1398 bp
ProteinDPOGS208465-PA465 aa
Genomic positionDPSCF300064 - 1607161-1609689
RNAseq coverage407x (Rank: top 30%)
Annotation
HeliconiusHMEL0080532e-1055.41% 
BombyxBGIBMGA010649-TA2e-9151.88% 
DrosophilaCG15544-PA9e-2263.77% 
EBI UniRef50UniRef50_UPI000224647F4e-5956.34%UPI000224647F related cluster n=1 Tax=unknown RepID=UPI000224647F
NCBI RefSeqXP_001848419.16e-4645.83%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3838567903e-5956.74%PREDICTED: uncharacterized protein LOC100883063 [Megachile rotundata]
NCBI nr blastxgi|3454816182e-6039.67%PREDICTED: hypothetical protein LOC100680541 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL25752 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208465-TA
ATGGCTGATTTAATATTAATATTGCTTTACACATTAGTGGCTTCCCAACCGGTGCCGATAGCCGAAAACCATTATGTGAAACAGAATGTACCCCAACACGTCCACAACGCTCAAAGTATACCAAAGCAAGGAGATACTTTAAATGAAAACGACGAACAAATTATATCTGTAAGAATAACGTCTTCTGTAGCTGTTGGTAGAACAAAACCAAGACCAATAATAACAATTAATGAGAACAGAAAAGATGTCATGAATCAGGATACTCATATAACTACGCCTGCGATTGAAGTAAGCCAACCATTTGCCTCTACAACCGACATTGATGATGTAACGACAGCTCTCCCAACGACTATTTTAGAAGATTTCGAAGCTGTCACAGATATACCGCCAAATTTAATCGGGGCTAACATAGAGTATATAAATCAGTTAAAAGCTAAAAAAGGAACAAGAGGTCTTAGTAATTATGAAAGACTTCATTCATTTACACCGAACCATGGCTATCCAAGTAGTAATAAGACGAATGATTCGGTCGCTGAATCTTCTATATTAGGACAGGACGATGAAGTAGATGAAGTACCGATAGCAAGAAATGTGCCGTCGTCATATTACAAAAACAATATTCAGGAACCAACCCGAGCGAATACTAGATTGACAACTGTTAATTTTAATGTAGTACACGAAAACCAAAAGGTTCCCAAATATAATGACCGTTTACCTCCATACGTAAATGCAAATCAGAATGTTGGCGATCAGAAGTTTTATAATACACCATCGCAAATATACAGCCAGCCGGCACAAGTGTACAGCGAACCAGCTAAGTTTTATAGCGAACCAGCTAAAATATACAGTGAACCAGCAAAGATATACAGCGAGCCGGCCAAAGTATACGGTGAACCTTCAAAGATATACAGTGAACCGGCTAAATTTTATAGCCAACCCGCGTCTTTACATTTGTCTCCCGAACATGCTCAGGAACGCCGACCGTGGCAGCCACAGAAATTGAAATCCACCACCATTAGTACGACACAAACTGCCTCAACTAGCCCATATAAACAGAACTCTCAGAGAATCACCGATCAGGAGCCTGAGAGGAACTACGAGGTGGACGAAAAGGTCAGTGTGATGACAGACGGCAGGAGTCACGGCGAGCAGACCACTGATGTAGATAACCCAGAGAATTGCAAACAGGAGAATTGCAAAGTTGGTTACGTAGTGGAAGGCCGTCAGTTTAGAAAATATAGGGTTGAAGAGAGAACTCCTGACGGCTTCATCGTGGGAGAGTACGGGGTGGTGCGGAACGAGGACGGCGCGCTGAGGGGTGTGAGGTACACAGCAGACAGCGACGCCAGCCCCCGCCTCATACACGACGCCCTCATGAAGTTCCTACAGCTGAAGTAG

Protein sequence:

>DPOGS208465-PA
MADLILILLYTLVASQPVPIAENHYVKQNVPQHVHNAQSIPKQGDTLNENDEQIISVRITSSVAVGRTKPRPIITINENRKDVMNQDTHITTPAIEVSQPFASTTDIDDVTTALPTTILEDFEAVTDIPPNLIGANIEYINQLKAKKGTRGLSNYERLHSFTPNHGYPSSNKTNDSVAESSILGQDDEVDEVPIARNVPSSYYKNNIQEPTRANTRLTTVNFNVVHENQKVPKYNDRLPPYVNANQNVGDQKFYNTPSQIYSQPAQVYSEPAKFYSEPAKIYSEPAKIYSEPAKVYGEPSKIYSEPAKFYSQPASLHLSPEHAQERRPWQPQKLKSTTISTTQTASTSPYKQNSQRITDQEPERNYEVDEKVSVMTDGRSHGEQTTDVDNPENCKQENCKVGYVVEGRQFRKYRVEERTPDGFIVGEYGVVRNEDGALRGVRYTADSDASPRLIHDALMKFLQLK-