Monarch geneset OGS2.0

DPOGS204383
TranscriptDPOGS204383-TA1524 bp
ProteinDPOGS204383-PA507 aa
Genomic positionDPSCF300002 - 1714832-1717991
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0130781e-14365.23% 
BombyxBGIBMGA003662-TA2e-3729.23% 
DrosophilaCG14325-PB2e-3025.45% 
EBI UniRef50UniRef50_D6WNC51e-5231.91%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WNC5_TRICA
NCBI RefSeqXP_971580.13e-5331.91%PREDICTED: similar to AGAP002751-PA [Tribolium castaneum]
NCBI nr blastpgi|1582905802e-4430.65%AGAP002751-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700079023e-5231.68%hypothetical protein TcasGA2_TC014646 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL29959 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204383-TA
ATGAAATTTCCTGTTGGAATTAATAAAAACACATTCATAACTTATAAAAATAAGGAAAACTTAGAGAAGGATCGAAAAATAAAATCTGAAGTTTTTGATTGGGATTCTAGACACCCAAAAAGTCTTGTAGAAATGTGTGTAGAAAAATTAAGCTTAAATTGGATGGGTGAACCTAAATTAGAGGATCTTGTTGCCAAGGACAGAGAATACTTTTTGCAAATATTAGATACAAATATACCACTACAAATTCTAGTTGATAACATTCAAAATGACATTTTTTGGAAAAAATGCTACATCTCTAAATGGTCTGATATCCCAATAGAAATAAATGATAAATCTTGGATTACTATTTTTATGGAGCGCCATTATGCGGACTTTTTGGAACATTTGAATCCAAGACACTACGATCCAGAAAAAGTCCGCAATTTAGTAAAACTATGTGGTCCATTCATACAAACGTTATCAATACGAAGTTTAGTCCCTTCAGATTTAAATGATCAGCGAAACACATCTCCAGAAATGGCTGGTTTAATATCTGGCCAAAGACAAGATATGAGTAAAATAAATAATAAAACGATGGAAACTATGCCAAGAGATCATATATCTTTGCATGCAGCTTTAAGCAGCTTAACTCATTTATCAGAATTACATATAACGTTTCAACAACGTTTTGTTGGTATTCAATATAGAAAAGACCAATTTCAGTTCACTATTAATGATGCAAAAAACTTAGCTCGTGGTTTGGAAAAGTGCACTCAACTGAAAATTCTAAGAATTACGCGAAGTAATATGAATTGTCAAAAACTTAAGTACATACTCCGGGGATTATTAGATAACCAAAATATTGACACCTTAGATTTTAGCCACTGCAAAATTGCTGATGACGGGGCATCTTCCATCGCAAAATCCATATCAAAAAGGGACAAAATTCGAAGTTTGATTCTTGCTGATAACATTTTCGGTCCTGTAGGAATTGAATATATATCACAAGTTCTGAACCATACAAGCTGTGGACTACGGACATTAGACTTAAGACTAAATAATAAACTTGGATCTGAAGGAATAGCTCAATTAGCTGTTGCTATAGCAAGAGGGTGTAATCTGACTTCATTAAATATTTCAGGTTGTGGTATAAAACCTCATGCATTATTAAAACCACCAGCTGGTGTATGGTCTGCTATAAATGCTGATAACCCACCAACATGTGGAGACCTACTCGCTAGAGCAATCGGCTTAGTGAAAACGCCCTTGAGATCCTTAGATATAAGCATTAATAACATTGGGACGCCCAGTGATAACGCCTTATCAAATGCAATTTGTTTGAGTTACCTAGTTGACATTAACTTAAAGAGATCCGGAATGGGATCTATGGCTATGGCTATTGCTGAATCAGCAGCCGCAGCTCAAAGATTGCGGAGAGAAGCAGAGAAAGGAATCCGTTTCAGGCGAAGTGCTGGACGTCTAGTTCAGGCTCGGAGAGTTGTTAAAGGAATGAATATAGGAGGCAAAGATCCTTACAATTAG

Protein sequence:

>DPOGS204383-PA
MKFPVGINKNTFITYKNKENLEKDRKIKSEVFDWDSRHPKSLVEMCVEKLSLNWMGEPKLEDLVAKDREYFLQILDTNIPLQILVDNIQNDIFWKKCYISKWSDIPIEINDKSWITIFMERHYADFLEHLNPRHYDPEKVRNLVKLCGPFIQTLSIRSLVPSDLNDQRNTSPEMAGLISGQRQDMSKINNKTMETMPRDHISLHAALSSLTHLSELHITFQQRFVGIQYRKDQFQFTINDAKNLARGLEKCTQLKILRITRSNMNCQKLKYILRGLLDNQNIDTLDFSHCKIADDGASSIAKSISKRDKIRSLILADNIFGPVGIEYISQVLNHTSCGLRTLDLRLNNKLGSEGIAQLAVAIARGCNLTSLNISGCGIKPHALLKPPAGVWSAINADNPPTCGDLLARAIGLVKTPLRSLDISINNIGTPSDNALSNAICLSYLVDINLKRSGMGSMAMAIAESAAAAQRLRREAEKGIRFRRSAGRLVQARRVVKGMNIGGKDPYN-