Monarch geneset OGS2.0

DPOGS203211
TranscriptDPOGS203211-TA1185 bp
ProteinDPOGS203211-PA394 aa
Genomic positionDPSCF300035 + 754149-759399
RNAseq coverage1137x (Rank: top 11%)
Annotation
HeliconiusHMEL0109767e-17183.52% 
BombyxBGIBMGA011094-TA1e-14472.83% 
DrosophilaCG8927-PA1e-4066.97% 
EBI UniRef50UniRef50_E9INK82e-4360.26%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9INK8_SOLIN
NCBI RefSeqXP_001688348.12e-5140.32%AGAP003037-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1107594154e-4537.33%PREDICTED: hypothetical protein LOC552217 isoform 2 [Apis mellifera]
NCBI nr blastxgi|3123719974e-5641.67%hypothetical protein AND_20724 [Anopheles darlingi]
Group
KEGG pathway 
Orthology groupMCL17854 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203211-TA
TTGGTTACCTTGCAATATGTATGGTGGTCGCGTCGGGCGCAGGGCGAGAGTGGGCGACCTCATATAGCTGCGCAGACAGCATTTCCTTTATTTGTTGTTCGTACGCGCCGTGATAGTGTCGTGCCACAGGACATCATGCGGACATTACTGATTTGTATGCTGTGCGTAGTCACAGTTTATGCACAATATGATTCACCAGCACCACCAAGGATCCAAATCCCAGGAGCAATTTTGGGAGCACCAGTGGGAAGACAGAGTGGTTTCAGACAGGGAAGATTACAAGCTATCCCGATTACAGGCCCTATCGCCAGAATACGCAGACCAGGACTCGCTCGACCATCTTTTAAATCTGTAGATGCAGCACCGCGCCCCAGCCTTCAATCTCTTGAAGAGCCTTCAAAGCCCGTCACAGAGGAGCCGGAAGATGACGCAGATGTCAACACACCAACATCTTTTGTATCTAGTATTTTCTCTACAGCAGAACCGCAAAATGATTTCTACGACATTACAACCACGTCGCAACCTAAACCACCAGCGGCTTATAACTCAGCACAATTCCAACAGACAACACCATCAGCACCAAAACCTGAACCAATAAGACCTACACAATATAGACCTTTATTACCTCAATCGTTTAGCCCTGTAAGGAATGAACCACAGAACAGACCGCAGCCGGCAAGGCTCCAGCCTTTATCCAGTAGACCACAACGACCCTCTTTCAGACCTGAACCAAAACCATTTGTAGAAGAAGAAGATGATTATGTTCAACCAGTAAGACAATATTCGAGACCACCAGTTAAGAGTGTACCACAACAAAAGTATACCCCATCAAATTCGAGGGAAAAGAAGCCTGTGGCTCAAATCATTCGTAAATTTAGAGATGAAAATGAAGATGGAAGCATCACATGGGGATTTGAGAATGACGATGGAACGTTTAAAGAGGAAACGATTGGCGTCGACTGTATCACTCGTGGCAAGTACGGATATGTCGACCCCGATGGTCTGAAACGCGAATACAACTATGAGACTGGAATTGCGTGCGATAAGACCAAAGAGGATAAAGAACAAAAAGGTTTCATTGATTATCAAGAAAATAAGGCTGTATTACCAAACGGTATTACCATAGATCTCAATGCCATGGGTAAAAAGTCTAAAAGGCCATTTAGATCAGCGGGTAATAACTAA

Protein sequence:

>DPOGS203211-PA
LVTLQYVWWSRRAQGESGRPHIAAQTAFPLFVVRTRRDSVVPQDIMRTLLICMLCVVTVYAQYDSPAPPRIQIPGAILGAPVGRQSGFRQGRLQAIPITGPIARIRRPGLARPSFKSVDAAPRPSLQSLEEPSKPVTEEPEDDADVNTPTSFVSSIFSTAEPQNDFYDITTTSQPKPPAAYNSAQFQQTTPSAPKPEPIRPTQYRPLLPQSFSPVRNEPQNRPQPARLQPLSSRPQRPSFRPEPKPFVEEEDDYVQPVRQYSRPPVKSVPQQKYTPSNSREKKPVAQIIRKFRDENEDGSITWGFENDDGTFKEETIGVDCITRGKYGYVDPDGLKREYNYETGIACDKTKEDKEQKGFIDYQENKAVLPNGITIDLNAMGKKSKRPFRSAGNN-