Monarch geneset OGS2.0

DPOGS212058
TranscriptDPOGS212058-TA1665 bp
ProteinDPOGS212058-PA554 aa
Genomic positionDPSCF300317 - 149299-153751
RNAseq coverage1064x (Rank: top 12%)
Annotation
HeliconiusHMEL0093570.071.94% 
BombyxBGIBMGA009636-TA0.062.69% 
DrosophilaCG5726-PA2e-1224.17% 
EBI UniRef50UniRef50_Q7JRH53e-1024.17%CG5726 n=13 Tax=Drosophila RepID=Q7JRH5_DROME
NCBI RefSeqXP_002091963.12e-1223.43%GE13927 [Drosophila yakuba]
NCBI nr blastpgi|1954875683e-1123.43%GE13927 [Drosophila yakuba]
NCBI nr blastxgi|1892392934e-1725.98%PREDICTED: hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25632 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212058-TA
ATGTCATCGGGTCGCGGTAGGGGTTACAGCGGGAGACATCATCTAAGAGAACCTCGTCCTGGTGATACGGAAGACGAAATCAATGCTAGGACACTAGCGGAACTCAGCCCTCGAGCTACAGATACAGAGGTCATGGAGGAGAAGGACGACGACAACGAGGACTCGATAGCGGAGCTGCTGCACGTGGACGAACAGTTCGCGCCGCTACTCATATTGTTGGAACAATTCTCTCCCGGTGAAGACGGCATACAATTCAACCGGAAGTTACGTCACTTCGAGACCACCGTCACCAGCATGTGTCCGGACGAGGCCAGGTTACAGCAGGCGTTCGCGTCTTTCCGTGCGGCGGCCCTGTGCAGTGCTGGCACGTGCCGTCGCCTCGCGGCAGTCGGAGCGTCCTTCACGAGGCAGCAGCGGCAGCAGTTACTTAGGACGACGTTACTGAACGTCATCATGCAGGGGACCTTCACTAAGCTGGAGGCATTGAAGAGATCCAATCCCATTTACTTAATAAACGCAGCTAATCTCATGGGAGATTATTTTGCAGAGGCTCGCCTTTGTAACGGTAACAAAGTGCACATATTGGCCAGCCCCTTACTGCAATACATGCGAGCTCTGCTAGCCAGCGACGATCTTAGAGCGCACAGAAGTCTAGCCACGCAGCTGATGCAGAACGGTCGCGAACTGTTGTCAGTCTTGTCCCAGGAGCTAGACGAGCTCTCTATATCAATCCGTCTTCGCCTACTCTCCCCTCCACCCGTTAGCATAATCTGGCTGCTCTTATCCGCGGACCTCTGCCTCAACAAGTTTCTGCCGCTTCCGAACACGCTGCAACAGTTCTACGCTGCCCATCTAGATCTAACAACGGACGAGAACTTCAGCGAAGTCAGCTACGGCTCGTGGAAGAAGAATAAGGAAGACCACAGAACGGAAACGGAAACTGACAAGGTCAGCCAGAATATGTCGCAGATAAGCATCAACCAAGACGAGGACACCAAGCCGAAACTACGCCCCATTTTGGGTGCCGGTGCTGGGTTGTTACGTCAAGAACAACTGACGCAGGACAGCTTCGAAACGTGGACTAAGAGTAATCTGACGAAGAGATACGATCTGAACTCGTGGAGACAGACGGACGAACAGAAGGAGGAAACGGAAACGGTGTTCAATCCGACGGTGATGCCTCCCAACATGCAGAACTACGCGGTACCAGACTATCCCCCGCCGAACTTCAACTATCAAGAGAACTACGCCAACTACCTCCAGCAGGGGGACGGAGCTTACATACAGAACATCAACAACGCACAGAACTACACAGCACAGGAATACGTGCCGCAGTACAACGAGCGGCCCGACAGGGTCGAGAGGATCAACAACGAGGTCCAGAAGACGGCGAGCAGGCGCCCCATAGAGAACTGGCGGAGGGAGAAACCGGATAAGGAAGACGCCAAGAACAGGAACTGGAGCAGACAGCTCTCCAAGGATAATGTTAACGACGACAGCGACGGAGAGAGACTGCGCTCGGCCTCCAGGAGTTCCAGGGATTCCAGGTCAGGGAGGGCGAGGGACGGCGACAAGACCCCGCCCCGGAAACACGTCACAGAAGTGCCCAAGAACCACAAGTACTGGGACCACGACGACCGCTGCGACAAGGACTACAACAGTTAA

Protein sequence:

>DPOGS212058-PA
MSSGRGRGYSGRHHLREPRPGDTEDEINARTLAELSPRATDTEVMEEKDDDNEDSIAELLHVDEQFAPLLILLEQFSPGEDGIQFNRKLRHFETTVTSMCPDEARLQQAFASFRAAALCSAGTCRRLAAVGASFTRQQRQQLLRTTLLNVIMQGTFTKLEALKRSNPIYLINAANLMGDYFAEARLCNGNKVHILASPLLQYMRALLASDDLRAHRSLATQLMQNGRELLSVLSQELDELSISIRLRLLSPPPVSIIWLLLSADLCLNKFLPLPNTLQQFYAAHLDLTTDENFSEVSYGSWKKNKEDHRTETETDKVSQNMSQISINQDEDTKPKLRPILGAGAGLLRQEQLTQDSFETWTKSNLTKRYDLNSWRQTDEQKEETETVFNPTVMPPNMQNYAVPDYPPPNFNYQENYANYLQQGDGAYIQNINNAQNYTAQEYVPQYNERPDRVERINNEVQKTASRRPIENWRREKPDKEDAKNRNWSRQLSKDNVNDDSDGERLRSASRSSRDSRSGRARDGDKTPPRKHVTEVPKNHKYWDHDDRCDKDYNS-