Monarch geneset OGS2.0

DPOGS208581
TranscriptDPOGS208581-TA1185 bp
ProteinDPOGS208581-PA394 aa
Genomic positionDPSCF300064 + 1809130-1810885
RNAseq coverage400x (Rank: top 30%)
Annotation
HeliconiusHMEL0049367e-17071.32% 
BombyxBGIBMGA010593-TA3e-14569.05% 
DrosophilamRpL38-PA5e-10042.89% 
EBI UniRef50UniRef50_UPI00017917A37e-11247.37%UPI00017917A3 related cluster n=1 Tax=unknown RepID=UPI00017917A3
NCBI RefSeqXP_967732.14e-11854.12%PREDICTED: similar to mitochondrial ribosomal protein, L38, putative [Tribolium castaneum]
NCBI nr blastpgi|3323739884e-11953.03%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|910797823e-12854.12%PREDICTED: similar to mitochondrial ribosomal protein, L38, putative [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[114-297] IPR0089141.7e-42Phosphatidylethanolamine-binding protein PEBP
Orthology groupMCL13970 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208581-TA
ATGAACTCTATACGTTGCCTTTGGCGCTTAAACAACATACAACTCCAGCAAATTAGGTTAGGTCATAGAATCAGAGGAAAAGCACCAATATACTGTAGAACTATTAAAGAAAGAATAGATGAGCTCAATTATAAGGATGAGTTATATACGACGAGAATTGATATCGGATTTCCGTCTGAAAAGAAAAATTCATTGTCAGCCCGTGAAGATCGTATTGCTAAAGTTCAGAAAGCTAAAAGTGATAAAAACTTAGAACAACTAGCAAGAAATTTACAGTTGGAAATTAATATGGAGCAATCACGAAAGGACTGGCTGCAGTCCTTGGGGCCCTTGCATAAGAAACAGATAGCAGACCACTATGCAATATTTGAACATTTATATGGAGAGGGTTTCTTTGTACCACATTTAGATTTAGAAGTGTTTTATGATCTAGGAGATGGCAATTGTTTACCTGTATATTATGGTAATGTGGTAAAACCTGCCGAGGCTTCACAAAGTCCTATAGTTAGCTACGAATCTGATGGCAATTCACTATGGACTCTTGTTTTGACAAATCTTGATGGGCATTTGAAAGATAATGAAAAGGAGTATGTTCATTGGATGGTTTCAAATATTCCTGGTAACTGTATAGAGAAAGGAGATGTCATTTTTGATTATTTACGTCCCTTTCCTGTAAAAGGTACTGGTTACCATAGATATGTTTTTGTACTATACAAGCAAGATGTGCAAATGAAATATGATTTGCCAAAAGTTTCATCAGCATCCTTGGAAGACAGAACATTTCAAACAAGAGAATGGTATAAGAAATATCAAGACAACATAACACCAATTGGTCTTGCATTCTTTCAAAGCGATTGGGACAAAACAGTAAAAGATTTCTTCCACACAACCTTAAATATGAAAGAACCAGTGTATGAGTATGATTTCCCTGATCCATACATACGTCCACAAGAATGGTTCCCTCGACGAAAACCATTTAATATATACATGGACAAATACAGGGATCCAAAACAAATAAATAAAGAATATTTGTTACGTAAACTTAAAAACGAGCATCCGTTCAGAAGCCCAGCACCTCCACTCCAATTCCCAAATGCTCATCCCTTCCCAAAAACGATGCCATCATGGTTGAAACTACACGAACAGAAGATAAGACTCAAATGGGGAAGAATAAACGATGTTTGA

Protein sequence:

>DPOGS208581-PA
MNSIRCLWRLNNIQLQQIRLGHRIRGKAPIYCRTIKERIDELNYKDELYTTRIDIGFPSEKKNSLSAREDRIAKVQKAKSDKNLEQLARNLQLEINMEQSRKDWLQSLGPLHKKQIADHYAIFEHLYGEGFFVPHLDLEVFYDLGDGNCLPVYYGNVVKPAEASQSPIVSYESDGNSLWTLVLTNLDGHLKDNEKEYVHWMVSNIPGNCIEKGDVIFDYLRPFPVKGTGYHRYVFVLYKQDVQMKYDLPKVSSASLEDRTFQTREWYKKYQDNITPIGLAFFQSDWDKTVKDFFHTTLNMKEPVYEYDFPDPYIRPQEWFPRRKPFNIYMDKYRDPKQINKEYLLRKLKNEHPFRSPAPPLQFPNAHPFPKTMPSWLKLHEQKIRLKWGRINDV-