Monarch geneset OGS2.0

DPOGS208052
TranscriptDPOGS208052-TA1797 bp
ProteinDPOGS208052-PA598 aa
Genomic positionDPSCF300203 + 294482-313056
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0178101e-14362.16% 
BombyxBGIBMGA001479-TA5e-10550.20% 
DrosophilaCG42404-PE2e-3645.99% 
EBI UniRef50UniRef50_Q9VFA83e-3445.99%CG42404 n=7 Tax=melanogaster group RepID=Q9VFA8_DROME
NCBI RefSeqNP_650437.36e-3545.99%CG42404 [Drosophila melanogaster]
NCBI nr blastpgi|3867658301e-3345.99%CG42404, isoform D [Drosophila melanogaster]
NCBI nr blastxgi|3454881912e-3829.50%PREDICTED: hypothetical protein LOC100121790 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL20715 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208052-TA
ATGGCTCAAGGCGAGTGTATAGACTTCCGTGGCAGACATATATCAAACGGTTTGCATTATGTGCCCGGCCCCGATACATGTACGCTTTGCATCTGTGAAAATGGTTTGCCTAAAGTGTGTAAGGCTGTCTTGTGTTCACCCCCACAAGATTGCCGATCATTCCGTTTGGGCAATACTTGCTGTGAGTTCATTTGTCTTGATGATGTAGTGAAACCCACTGATGGGGCGGAGGCGAATATCAGGGTTGCGGCATCAGGAGCTGCTTCAGTGGTCCTGTTAACTATAGCCCTTGTTGTATATAGAGTGAGGAAACAAAAACGACGTAGGCCTCTGCATGCCGAAGACCAAAGAAGTTTGACTAGTATTGGATATATTAGTGGCAGTATGGGGTACATGGGAGGTACTTGTGAAACAGCTCAACTGGGTGCCTGGAAGCCTCCTTCGAACTATCTTCCTAGAGGAGAAGCACCACCACCATATGAAGAGGCTATGGCACAATGCAGATCTGATCCTATGAGAGTAACAAACGAAACATCTTTCCACCGTTCATACCCTTTGGAGCCTCGAGATGAAGTGTGCGCGACCCACGCCTATGTCAATCTACCACGACCACTCGTGCAGATGGCCAATACCCAGAATTGCTACAACGCTCCACTCCTACAACCAGTACACCCGCCTGAAGCCAGGGAGGCGATCGTCATGCCGCATCCATTGGTAGCTAATATGGGGGTTGGGGGTCGCCTGACGGGGTCGCTGGGGGCTATCAGCGCCCGCCTCAGCGTACCCCGCGATGACCACGAACGTGACAGAGTCGAACGTGTCGAACGACCCCACAATGTCCCCGGCTTCTACACCACACACACGGCATTACACCGCACTATACCACGTATATCGACAGCCCTGGACACGTCCACACTAGAAGCGATGGGTTTCAGTCGAGCGGACCGGTCCCTGGCGGGGGAGGTGAGGAGATCCTTCCACAGACCAGACGCCAGGGAGAGAGACAGACCTCACACCGGGAGGAGCGTGCCCAGGAACCTGAACATCGCGTCCGCCTCGCCCGCACACGAACAGGAAGACGATCTCGCTGTGCGGCCCCGTCTCCACAGTGTCCACAACGATGAAGACAAAAAGGAGTTGACACCACACAACCACCCGACACTACCTCTTACTGTGGACAAACCATCCTGCGAGTGTAGCTACGAGGCAGCTCGCGACGCTGACGATTATCGCAGCGAGTGTGAGAACTGTCACTCAACCAACAACTCGGGTTGGGAGGAGGGCGGTAGTGAGTGGTGCGGCGGAGGAACACAGACGCTGCAGAGACGAGCGCCGCCGCCGCCCGCACAGCCCGCCGCCACCGCTACACTGCCGCAGCCGGTCACTAAGGGACAGGGTCTCGCTGTGCGGCCCCGTCTCCACAGTGTCCACAACGATGAAGACAAAAAGGAGTTGACACCACACAACCACCCGACACTACCTCTTACTGTGGACAAACCATCCTGCGAGTGTAGCTACGAGGCAGCTCGCGACGCTGACGATTATCGCAGCGAGTGTGAGAACTGTCACTCAACCAACAACTCGGGTTGGGAGGAGGGCGGTAGTGAGTGGTGCGGCGGAGGAACACAGACGCTGCAGAGACGAGCGCCGCCGCCGCCCGCACAGCCCGCCGCCACCGCTACACTGCCGCAGCCGGTCACTAAGGGACAGGGCCGTAACGGAATGGGCAACCCTTCGAACTGGGAGAACTGGTTCAATACGATCCCAGACTCCGACAGCGAGTCAGAAGAGGAATGA

Protein sequence:

>DPOGS208052-PA
MAQGECIDFRGRHISNGLHYVPGPDTCTLCICENGLPKVCKAVLCSPPQDCRSFRLGNTCCEFICLDDVVKPTDGAEANIRVAASGAASVVLLTIALVVYRVRKQKRRRPLHAEDQRSLTSIGYISGSMGYMGGTCETAQLGAWKPPSNYLPRGEAPPPYEEAMAQCRSDPMRVTNETSFHRSYPLEPRDEVCATHAYVNLPRPLVQMANTQNCYNAPLLQPVHPPEAREAIVMPHPLVANMGVGGRLTGSLGAISARLSVPRDDHERDRVERVERPHNVPGFYTTHTALHRTIPRISTALDTSTLEAMGFSRADRSLAGEVRRSFHRPDARERDRPHTGRSVPRNLNIASASPAHEQEDDLAVRPRLHSVHNDEDKKELTPHNHPTLPLTVDKPSCECSYEAARDADDYRSECENCHSTNNSGWEEGGSEWCGGGTQTLQRRAPPPPAQPAATATLPQPVTKGQGLAVRPRLHSVHNDEDKKELTPHNHPTLPLTVDKPSCECSYEAARDADDYRSECENCHSTNNSGWEEGGSEWCGGGTQTLQRRAPPPPAQPAATATLPQPVTKGQGRNGMGNPSNWENWFNTIPDSDSESEEE-