Monarch geneset OGS2.0

DPOGS207223
TranscriptDPOGS207223-TA1194 bp
ProteinDPOGS207223-PA397 aa
Genomic positionDPSCF300235 - 417359-420838
RNAseq coverage896x (Rank: top 14%)
Annotation
HeliconiusHMEL0126545e-17772.96% 
BombyxBGIBMGA008554-TA7e-17071.50% 
Drosophila% 
EBI UniRef50UniRef50_UPI000179270F1e-4443.00%UPI000179270F related cluster n=1 Tax=unknown RepID=UPI000179270F
NCBI RefSeqXP_001946771.13e-4543.00%PREDICTED: hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1936433975e-4443.00%PREDICTED: uncharacterized protein C20orf72 homolog isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|1936433976e-4343.00%PREDICTED: uncharacterized protein C20orf72 homolog isoform 1 [Acyrthosiphon pisum]
Group
KEGG pathway 
Orthology groupMCL16378 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207223-TA
ATGTTTCGTCCACTTAGTGCTTTTCATCGAAAACTCATATTACAAAAAGAAACTGTCAGAACTAAAGTGGTTCGACCTGCCTCTATCCTAAAACCGGCTGAAAAAATAAAACTATACAATAGAGAAAATAAAGATCTTTTCGGGCCATTGCTGGAAACTAATAGGCAAAGGAAGAGTCGTTTAAAGAAAGAAGCTAAAAATAGTTCTCATCAAAACAACGAGTCAATTGGATCGAATGAAAATACGCCGAAGTTTGTGGATGCTAGTAAACATGATATAGTTCAAAGACAAGTGAAGTCTAGTTATTTAACGAACTTAGGGATAATTCAACACAAAATGTGGCAAAATACGAGGACTTGTTTCAATTTTCTGGGTGTTGTATTCAATGGTGTCCCAATCAGATGTTTTAGGACCAATGCAATTAACTATTCAGCTCGGAGCTCGCCTGCACAGCCAGGTTATGCCAGTGAGGACAAAATAATCCAAATAAAACAACACCAAAATGATTTTACTAAACAGTTTCCATCAGTTACATTGATATTAAACAAAACAATGACGGAGGAATCTAGAAAAGCTCTTGATAAATGGAAACAAGAAAGGATAGCAGAAATGGGGTTAGAAGAATTTAATAGATTTTATGAAGCACAACTAGCTGTTGGCACGAAGTTCCATAATACCCTTAAAAACTATTTCACACAACCTCAAACCCAATTACGTATTGAAAAGGATGTTGAAGCTGTCTGGGTGTCTGTGAGCGAGGTCTTAAAGAGTATATCTTCTCCGAAAGCGATAGAATCTAATGTAGTTCATCCCCTATTGAAATATAGAGGAATTTTTGATGCTATAGCAGATTATGAAGACAAGCCTACATTGATTGAATGGAAGAAGTCAGATAAGCCTCGCAAATCGTTACAAATGACATATGACAATCCTGTTCAATTAGCTGCCTATTTCGGGGCAGTGTGCAATGACCTTAACTATAAACATTTCAATGTCCGAGATGCATTATTAGTAATTGCTTACACAGATGGATCTAAGGCAGATGCATATCATTTATCGACGGACAAGTTGAGGGAACATTGGGCCCAATGGTTAATAAGACTGGAGGAATATACGATCAAATACAATAATGATTCTGAAAAAATACTAAAAGGTGGCAAACGTTTGTTTGAAGAAGAAATTGGGAATCTCTGA

Protein sequence:

>DPOGS207223-PA
MFRPLSAFHRKLILQKETVRTKVVRPASILKPAEKIKLYNRENKDLFGPLLETNRQRKSRLKKEAKNSSHQNNESIGSNENTPKFVDASKHDIVQRQVKSSYLTNLGIIQHKMWQNTRTCFNFLGVVFNGVPIRCFRTNAINYSARSSPAQPGYASEDKIIQIKQHQNDFTKQFPSVTLILNKTMTEESRKALDKWKQERIAEMGLEEFNRFYEAQLAVGTKFHNTLKNYFTQPQTQLRIEKDVEAVWVSVSEVLKSISSPKAIESNVVHPLLKYRGIFDAIADYEDKPTLIEWKKSDKPRKSLQMTYDNPVQLAAYFGAVCNDLNYKHFNVRDALLVIAYTDGSKADAYHLSTDKLREHWAQWLIRLEEYTIKYNNDSEKILKGGKRLFEEEIGNL-