Monarch geneset OGS2.0

DPOGS209781
TranscriptDPOGS209781-TA1299 bp
ProteinDPOGS209781-PA432 aa
Genomic positionDPSCF300117 - 1128529-1137241
RNAseq coverage1642x (Rank: top 8%)
Annotation
HeliconiusHMEL0121590.091.28% 
BombyxBGIBMGA008006-TA0.087.19% 
DrosophilaCG30497-PA1e-13266.57% 
EBI UniRef50UniRef50_E2BC861e-13867.30%Protein FAM46C n=1 Tax=Harpegnathos saltator RepID=E2BC86_HARSA
NCBI RefSeqNP_001135818.12e-14767.18%hypothetical protein LOC100119622 [Nasonia vitripennis]
NCBI nr blastpgi|3071690905e-14767.01%Protein FAM46A [Camponotus floridanus]
NCBI nr blastxgi|3838615882e-14066.84%PREDICTED: protein FAM46A-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[45-375] IPR0129376.9e-141Domain of unknown function DUF1693
Orthology groupMCL10902 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209781-TA
ATGGGCGTTATCAAAGTGACTCCGGACACCGCGTTCCTCATCTCAGCCTGTGATATGTTCGATGATATGAAGAGCGAATCGAGCTACACCTCTTCGGAAGAAGGCTGCGGTGGTGGTGAACGTCATGCCGTGCTGAGTTACGAGCAAGTGCGGAGGCTGAATGACGTCATGGACGAGGTGGTCCCCATACACGGGAGGGGCAACTTCCCCACACTGCACGTACGCCTTCGAGAGTTGGTAGCTGGAGTGAGGGCTCGGCTTGAGACCGCTCAGAGCGAGGGCGGCGCCGGCATCACAGTTAGAGATGTCCGTCTGAACGGAGGCGCAGCGAGCAACGTGCTAGCGGACCGCCCGCAGCCATACTCGGACATTGACCTCATATTCACAGCGGACCTGCCCACGGCTCGACACTGTGACCGCGTCAAGGCCGCAGTCCTCAGTCACCTCTCGACCTTAATGCCGGCCGCCACACCCCGCCGGCGGGCTTCGCGGGCTTCACCCGCCGGCCTGAAGGAGGCCTACGTCTCTAAGATGGTCCGCGTCAACTCGGACGGCGATCGTTGGTCTCTCATCTCTCTCGGCAACTCCCGCGGACACAAATCAGTGGAGCTCAAGTTCGTGGACACGATGCGCAGGCAGTTCGAGTTCTCGGTAGACTCGTTCCAAATTGCTTTGGATTCTCTTCTGGCGTTCCACGAATGCGCCCAGCTGCCCATCGGCGAGAACTTCTACCCGACCGTGGTCGGCGAGTCCGTGTATGGAGATTACTCCGAGGCGCTACTCCACTTGTCGGAGAAATTAATAGCGACGCGACAGCCTGAAGAGATCCGCGGCGGCGGTCTGCTGAAGTACTGCGCTCTGTTGGCCAAAGGCTACCGACCGGCCAGACCCGACAAGATCAAGATCCTCGAGCGGTACATGTGCTCGAGATTTTTCATAGATTTTCCCGAACTCGGACAACAGCGGGCCAAGCTCGAAGCGTACTTACGGAACCACTTCGTGGGGCGTGACGAGGAGGCTTTAAAGCACAGGTACCTTACATTGCTGCACGGCGTGGTGAGGGAGTCGACGGTGTGTCTGATGGGCCACGAGCGGCGGCAGACGCTTGCCTTGATCGAGGCGCTGGCTTGCCGCGAGCTGTGCGCTCGTGCGCCGGTCCTGATGCCGGTGCCGATGCCGGTGCCGCCGCCCGGGTACTACGTGTGCGTGTGTGCCGCTTGCGCCGCGTGTGCCGCTTGCGCCGCGTGCGCCGCCCCCGCCTGCTGCGACTGCTGCCGCCCTGCTCTCTGCCCGGCGTGA

Protein sequence:

>DPOGS209781-PA
MGVIKVTPDTAFLISACDMFDDMKSESSYTSSEEGCGGGERHAVLSYEQVRRLNDVMDEVVPIHGRGNFPTLHVRLRELVAGVRARLETAQSEGGAGITVRDVRLNGGAASNVLADRPQPYSDIDLIFTADLPTARHCDRVKAAVLSHLSTLMPAATPRRRASRASPAGLKEAYVSKMVRVNSDGDRWSLISLGNSRGHKSVELKFVDTMRRQFEFSVDSFQIALDSLLAFHECAQLPIGENFYPTVVGESVYGDYSEALLHLSEKLIATRQPEEIRGGGLLKYCALLAKGYRPARPDKIKILERYMCSRFFIDFPELGQQRAKLEAYLRNHFVGRDEEALKHRYLTLLHGVVRESTVCLMGHERRQTLALIEALACRELCARAPVLMPVPMPVPPPGYYVCVCAACAACAACAACAAPACCDCCRPALCPA-