Monarch geneset OGS2.0

DPOGS208966
TranscriptDPOGS208966-TA1095 bp
ProteinDPOGS208966-PA364 aa
Genomic positionDPSCF300009 + 950220-951546
RNAseq coverage173x (Rank: top 50%)
Annotation
HeliconiusHMEL0038890.084.34% 
BombyxBGIBMGA002439-TA1e-16780.77% 
DrosophilaCG12128-PA8e-11058.63% 
EBI UniRef50UniRef50_A7UR976e-11061.14%AGAP007570-PA n=4 Tax=Culicidae RepID=A7UR97_ANOGA
NCBI RefSeqXP_970427.12e-11966.56%PREDICTED: similar to CG12128 CG12128-PA [Tribolium castaneum]
NCBI nr blastpgi|910918924e-11866.56%PREDICTED: similar to CG12128 CG12128-PA [Tribolium castaneum]
NCBI nr blastxgi|910918923e-11462.05%PREDICTED: similar to CG12128 CG12128-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[4-336] IPR0037504.2e-169Protein of unknown function DUF171
[133-215] IPR0160275.8e-25Nucleic acid-binding, OB-fold-like
Orthology groupMCL14339 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208966-TA
ATGTTACTAGAAAAAGAGATTGAGGTAAAGAATAAGGAGGAAATGGAAAAAGAGAAGAAATATTCAGAGATTTCTACAGTAAGCATAGCTGTCCCAGGGTCGATATTGGAGAATGCTCAATCAGCTGAACTACGTACGTATTTGGCGGGACAAATTGCGAGGGCGGCGTGTGTGTTTTGTGTTGATGAAATAATAGTGTTTGATGACATTGGTGATAAAGTAGACACCAAAAAATCAAAGCTGGAAGATAATAGTGGCACCAAAATTGCACGTAAGAGTTGTGTGCAGTTGGCAAGAATTTTACAGTACTTAGAATGTCCTCAATATTTGAGAAAACATTTTTTTCCATTGCATAAGGATTTAGAGTTTGCTGGTCTTCTCAATCCTTTGGATGCTCCTCATCATTTACGGATGTCAAACGATTTTCAATTCAGGGAAGGAATAACAATGAATAAGAATGTTAAACCTGGTAAAGGCTCACAAGTTAATGTTGGTTTGTTACAAGATGTATCCACTGATAAGTTGCTTAATCCTGGCATAAGAGTTACTGTTAAATTACTTCCACTGACAGAGGGTAAGAAAAAATTGAAAGGGAAAATTGTCAGTCTGTCCACACCAAGAGCCGAAACTGGAGTGTATTGGGGTTACACTGTTAGAATCGCAAATAACCTTAGTCAAGTGTTTACCCAGTGTCCATACAAGGATGGATATGATTTGACTATAGGAACCTCAGATAAAGGAACACCAATAGATGATTTGCCCAACAAAGAAGTTAGATACAACCATGCTCTAATAGTTTTTGGTGGTTTACATGGCATAGAAGCAGCATTGGAAAGTGACGAGCAATTACAGGTGGACGAAGCCAGTTTACTGTTTAACCATTATGTAAATGTGCTTCCCAATCAGGGGTCAAGAACAATCCGTACAGAGGAAGCCATTCTCATAGCGATGTCATGTTTGCAATCAAAGTTAAAGCCTAACAATGAACCAATGGTTTTTGAGAAAACTGGAATAGCAGTCAGCTCTTCATTCCCGAAAAATAAATCAAATGACTCACCCAATGACAAAAACATAGACTTAAGTAAATTTGATTAA

Protein sequence:

>DPOGS208966-PA
MLLEKEIEVKNKEEMEKEKKYSEISTVSIAVPGSILENAQSAELRTYLAGQIARAACVFCVDEIIVFDDIGDKVDTKKSKLEDNSGTKIARKSCVQLARILQYLECPQYLRKHFFPLHKDLEFAGLLNPLDAPHHLRMSNDFQFREGITMNKNVKPGKGSQVNVGLLQDVSTDKLLNPGIRVTVKLLPLTEGKKKLKGKIVSLSTPRAETGVYWGYTVRIANNLSQVFTQCPYKDGYDLTIGTSDKGTPIDDLPNKEVRYNHALIVFGGLHGIEAALESDEQLQVDEASLLFNHYVNVLPNQGSRTIRTEEAILIAMSCLQSKLKPNNEPMVFEKTGIAVSSSFPKNKSNDSPNDKNIDLSKFD-