Monarch geneset OGS2.0

DPOGS202747
TranscriptDPOGS202747-TA1086 bp
ProteinDPOGS202747-PA361 aa
Genomic positionDPSCF300464 - 1523-2608
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0115237e-16374.52% 
BombyxBGIBMGA001748-TA2e-15267.87% 
DrosophilaCG6073-PB2e-7541.62% 
EBI UniRef50UniRef50_E0VIR64e-7445.30%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VIR6_PEDHC
NCBI RefSeqXP_396642.25e-8343.99%PREDICTED: similar to CG6073-PA [Apis mellifera]
NCBI nr blastpgi|3504073606e-8343.44%PREDICTED: FAM203 family protein GA19338-like [Bombus impatiens]
NCBI nr blastxgi|1565497343e-8445.63%PREDICTED: FAM203 family protein GA19338-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054887.6e-16binding
KEGG pathway 
InterPro domain[102-274] IPR0072053.3e-47Uncharacterised domain UPF0507
[281-335] IPR0072065.3e-21Uncharacterised domain UPF0507, C-terminal
[6-355] IPR0160247.6e-16Armadillo-type fold
[8-313] IPR0119891.9e-12Armadillo-like helical
Orthology groupMCL12051 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202747-TA
ATGGATCAGGATCCACTGAATGAGCTGCTGGAATTCCTCAAACCAGAGTCTAGGTTAGACCTAAAACACATATCTCTCGACCATTTACTAGGTCTGTCCGGTACTGAAGACGGGATCCAAGTTCTACTAAATAATGAGAAAATAATACTATGTATCATCGAGTTGACAGACGATAAAGTCGCGGAAATAAGTAAAAACGCCTTACTTGTGCTCGTTAATGTCACGGCGAACGAAAAAGGTGCGCTAGATTTGTTAAAATACAGACCCAGTAGGAAAAAAAACATAATCGAACTTTTGATCGGTTACATACTAAATCCGGACAAAAAAGAAGCTGACGCTGCCTGCATGATACTATCCAACCTGACGAGATCGGAAAACGCTGTCGAAGTGTGCACGGACACGTTCCTACCGCACTTAAACGATTTATTAAACGTGTTTGTCAACACTAGTTACAACAAAACCGGATCTAACCTGAACTACCTCGCGCCTATATTTAGTAATCTGAGCTGTTCCCCGCGAGTCAGGAAGTGGCTTACAGACGAAAACCCCCATGTGCCACTAATTAAATTACTACCGTTCTGTAATTACCAAGCCTCCAGCATCCGGAGAGGCGGTGCTATAGGCACTGTTAGGAACATATCGTTTGACACAAACTATCACGAATTTCTACTATCAAATGACATAGATCTGTTGACTTACCTGCTGTCCCCGTTAATGGGCAGCGAAGACTATCCAGACGATGAAATGGAACAGCTACCGATAGCCCTCCAGTATTTGGGCAAAGAAAAACACAGGGACCCCGATATTGATATACGGAAAATGATTATTGAAACGTTGAACAAACTATGCGCCAAACGTAACATTAGGGAGGTACTGCGGGATAATGGTGTTTATTACGTCTTAAGAGAATATCACAAATGGGAGAAAGATCCGAACACTTTGCTGGCTTGCGAGAATGTGGTAGACATTTTAATACAGAAGGAAGAAGAAGTTGGTGCGGAAGATTTATCTAAAGTGGACATTCCAGAGGAATTGAAAGGCAAATTTGAGGAAATGGACAAAGAATATGTTGATAATGTACAATAA

Protein sequence:

>DPOGS202747-PA
MDQDPLNELLEFLKPESRLDLKHISLDHLLGLSGTEDGIQVLLNNEKIILCIIELTDDKVAEISKNALLVLVNVTANEKGALDLLKYRPSRKKNIIELLIGYILNPDKKEADAACMILSNLTRSENAVEVCTDTFLPHLNDLLNVFVNTSYNKTGSNLNYLAPIFSNLSCSPRVRKWLTDENPHVPLIKLLPFCNYQASSIRRGGAIGTVRNISFDTNYHEFLLSNDIDLLTYLLSPLMGSEDYPDDEMEQLPIALQYLGKEKHRDPDIDIRKMIIETLNKLCAKRNIREVLRDNGVYYVLREYHKWEKDPNTLLACENVVDILIQKEEEVGAEDLSKVDIPEELKGKFEEMDKEYVDNVQ-