Monarch geneset OGS2.0

DPOGS205082
TranscriptDPOGS205082-TA1929 bp
ProteinDPOGS205082-PA642 aa
Genomic positionDPSCF300074 + 152492-157842
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0057425e-14057.05% 
BombyxBGIBMGA006877-TA8e-17367.42% 
DrosophilaCG5439-PA5e-2336.36% 
EBI UniRef50UniRef50_E0VG856e-7233.55%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VG85_PEDHC
NCBI RefSeqXP_001606759.11e-7432.13%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3504005791e-7530.65%PREDICTED: hypothetical protein LOC100740036 [Bombus impatiens]
NCBI nr blastxgi|910913883e-8334.17%PREDICTED: similar to CG5439 CG5439-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-22protein binding
GO:00071541.4e-22cell communication
GO:00350911.4e-22phosphatidylinositol binding
KEGG pathway 
InterPro domain[499-600] IPR0016831.4e-22Phox homologous domain
[132-193] IPR0040122.8e-17RUN
Orthology groupMCL13686 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205082-TA
ATGAACAATAAAATTTTGAATACCCTAGCAAACGTAATAGATAACAAGGACGATATAAAGAGAAGGCTTGAACTTGGCCTAGCTACTTTGCGTGAGCTACAAGTCTGTGTTGATAATTGTCAACACCGTTTTGGTGGTAAATCTGAATTGGCAACAGAAGATGACATAAGAATAGTTAACTTATGTGAAAAATGGGAAAAATTATTGAGCCATGGACTTAAAACAAGTTTGTCAAATTCCACAATACAGAATTTTGTTACAGCTGGATTAAATTTCACCTTTAATATAGTAAATGTTGGGAATTCTTTATGGAGCTACAGTTGTCTTCATCTTACAAAACATGAGAAGGAAAGGTTCAAAATACTTTCACACATTAACACTCCTTTGGGGTACTTTCGAGCTTTCTTACGAGCCTCACTAAATGAGAGGTCCCTGGAAAGGTATTTGCAGAGCTGGATCTCACATGGCCTTCTAGCAGAATATTATGATGAAGGTTCATTTGTTAGGAGTCCAGAAGCACAATTACTTCCCGGTATAGCCAAAGGCTTATCAACCATACTGTTTGCTTTATCAATCGACAGAGCAGAGATGAATGAATCACAACAACCCAGTAATATAAACAAAGCCGAGCTATTAATACCAGTACCCACACCTGTTAGGACGTCTGGCAATTCAAAACGTAAGCCATTTAAACAGGTTATTTCTTTTGACAAAGTTGATGATAAAAAGGAAGTTACACAGAAGAGATCACTGGATACAATACAGAGTGCTACAGAATGTTCCTGGAGTAGTGCACCCGCTACGTGTCTTAACTCACCAGATCCTAAAATAGTACCAAAAGCATCAAACAGTGAACCAACTAGTTCGGATTACAGGAACAGTCTAAGATACTTCTTTCCGGATAGCGTTAAGGCAATGGAAAATCCACTGCAAATACTATCGAAGTTATCTGAAAGCGCCAAAGAAATATTTTCCAGTTCACAGAACAGTATAGATAAGAATGCTAAAGACGACGTGTCCGATTTGAGTGTGAATTGTCTCAAGTTATCGGAGAGTGACAGTGAAGAGGTGGCAGGAAGTATAGATGGAAGTACATCATGTTTGGAGCTTTGCTTCACAGAAGACGAGACTCACGATAACAACAGCGTCAGTTCCGCTTTAGATAGCAGAGAAAAGGACTTTATGAACTTGCAAATAAAGTTCAATCAGTATGAAGCGTCGAGTAAAGAGAAAATACATAAACTAGCAAAAGTCATTATAGATCTAAGCAAAGAAAATGATAGATTAAAAGATCAAATCAGAAACTATATGTCGGCTGTGGAGATGGGTAGAGCGATGAAGGACAATGAAAATACAGAACAAGAAATAGATATGTATGAGAGAAAATTGGTTCAGGTAGCAGAAATGCATGCAGAACTTATGGAGTTCAATCAACATTTACAAAGACGTTTACAAGATTTGGAGACCAGCGGTTTGGAAGTGCTCGATATGCCCGAGTCAAACGTCAAGGCTTACATACCAAGTGCATTCCTGGTTGGCAAAAAAACTCAGACGTATCATGTTTATCAGGTTTTCCTAAAGCTTGGCAGCGAAGAATGGAACGTGTACCATAGATATGCCAAGTTCCATGAACTGCACACGCAACTTAAAAAGTGCCATCCCGATATAGCCAGCTACAATTTTCCCCCCAAGAAGACGTTAAGGAAACGCGACACGCGCGTGGTGGAGTCTCGCCGTGTAGCCCTGCAGTCCTATTTACGTCATGTTTTGCTGTCGCTACCCGAACTAAGGAACTGCACCAGTCGCGCCGCTCTCACTACACTACTGCCTTTCTTTGGAACTTCGTCAACAACGAAAGAGGATGGTCTGAACATATTGCCATCAAGATCACAGTCCACAAACAACGTGTCAACAATCGATGGGCTTTGA

Protein sequence:

>DPOGS205082-PA
MNNKILNTLANVIDNKDDIKRRLELGLATLRELQVCVDNCQHRFGGKSELATEDDIRIVNLCEKWEKLLSHGLKTSLSNSTIQNFVTAGLNFTFNIVNVGNSLWSYSCLHLTKHEKERFKILSHINTPLGYFRAFLRASLNERSLERYLQSWISHGLLAEYYDEGSFVRSPEAQLLPGIAKGLSTILFALSIDRAEMNESQQPSNINKAELLIPVPTPVRTSGNSKRKPFKQVISFDKVDDKKEVTQKRSLDTIQSATECSWSSAPATCLNSPDPKIVPKASNSEPTSSDYRNSLRYFFPDSVKAMENPLQILSKLSESAKEIFSSSQNSIDKNAKDDVSDLSVNCLKLSESDSEEVAGSIDGSTSCLELCFTEDETHDNNSVSSALDSREKDFMNLQIKFNQYEASSKEKIHKLAKVIIDLSKENDRLKDQIRNYMSAVEMGRAMKDNENTEQEIDMYERKLVQVAEMHAELMEFNQHLQRRLQDLETSGLEVLDMPESNVKAYIPSAFLVGKKTQTYHVYQVFLKLGSEEWNVYHRYAKFHELHTQLKKCHPDIASYNFPPKKTLRKRDTRVVESRRVALQSYLRHVLLSLPELRNCTSRAALTTLLPFFGTSSTTKEDGLNILPSRSQSTNNVSTIDGL-