Monarch geneset OGS2.0

DPOGS204343
TranscriptDPOGS204343-TA1179 bp
ProteinDPOGS204343-PA392 aa
Genomic positionDPSCF300142 + 112513-116030
RNAseq coverage391x (Rank: top 31%)
Annotation
HeliconiusHMEL0023206e-11859.90% 
BombyxBGIBMGA007247-TA7e-14265.85% 
DrosophilanudE-PC7e-4537.91% 
EBI UniRef50UniRef50_D6WVA23e-4345.74%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WVA2_TRICA
NCBI RefSeqXP_393385.25e-4640.43%PREDICTED: similar to nudE nuclear distribution gene E homolog like 1 (A. nidulans) isoform B [Apis mellifera]
NCBI nr blastpgi|2700113071e-4245.74%hypothetical protein TcasGA2_TC005309 [Tribolium castaneum]
NCBI nr blastxgi|2700113071e-4545.78%hypothetical protein TcasGA2_TC005309 [Tribolium castaneum]
Group
KEGG pathwaysmm:Smp_1695607e-16 
 K06560 (MRC)maps-> Phagosome
InterPro domain[130-322] IPR0069641.1e-19NUDE protein, C-terminal
Orthology groupMCL13028 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204343-TA
ATGGAATCTCCAAATCAGGCAGAATTGGATACGGTGGAATATTGGAAGGAACAAGCAAAACATTACGAACAGAAGGCAACGGATATACAGCAAGAGTTGGACGAGTATACAGAAAATTCAGCTCAGCTTGAAAAAGAATTGGACGCGTCCTTGGTTCGAGTGGAAAAACGAAACAGAGATTTGGAACATCAAAATCTGAGGCTCAAAAACGATATCGATATGCTCAAATCTAAATTAGAAAGAAGTCAACATGAAACAAATGCACTTGAAAATGAATTACAAACACTCAAAATGGAAAAAGAGAAACAGGCTACATATATAAGAGAATTAGAGCAAAAGAATGATGATTTGGAAAGAGGACAACGAATTATATCAGAATCGGTTTCATGTATAGAAGCATTATTAAATCAGGCTTATGAACGCAATGCTGTCTTAGAAAGTGAAGTTGATGAAATTGAAAACTTGAGGGTAAAATTGCAAAGAGCTACAGATGAAGCCAGAGATCTCAAACAAGAGTTAATAGTTATAGAGAAAAATCCAATTTCTAAGAAGGAAGAGAGCAGTATAAATGAAAATGTGTGTAATGGACATACAACGAGGAGTCAAGTAGAAATAGAAACACAAACTTCCCTACTTTCACCGACAAAACGTGAACTGAATGGTAATGCTATGACGCCATCATCTAGAGTATCGGCTATTAACATTGTTGGAGATCTGCTCAGAAAAGTTGGGCTTGAAAGATTTCTTTGCCGTGATTGTGGTAAGGTCAAATGTTCGTGTGACGTCAGCACCGAGCAACAAAACACTGTTCTTGAAACGAACGTCCATGAACACGATCCTATAAATAATGTTGATAATTCTGTTGAGTACAGAAAAGGGACTTTCACGCGTCAATATTCAAAATCGGAACAGTCCAACACGACGGCTAACAAAACAATGCCATTAACACCAAAAAGTTCCGAACCATTCGAAAGATCCTACCATAATGAGAATGCGAAATTAAGAAGATCATTCATAGTGCGATCGAGAGAAGGAATAGAGAATTTGTTGAACTTCTCATCAGCAAGAAAAGCTTTGGAATCAAAGTTAGCATCGTGTCGTGGTACTGTTAGACCTAAAGAGTCACCAAACCAAACATCCGACGTCAATAAGGACTACAGGTGTGTACTCAAGAACTAA

Protein sequence:

>DPOGS204343-PA
MESPNQAELDTVEYWKEQAKHYEQKATDIQQELDEYTENSAQLEKELDASLVRVEKRNRDLEHQNLRLKNDIDMLKSKLERSQHETNALENELQTLKMEKEKQATYIRELEQKNDDLERGQRIISESVSCIEALLNQAYERNAVLESEVDEIENLRVKLQRATDEARDLKQELIVIEKNPISKKEESSINENVCNGHTTRSQVEIETQTSLLSPTKRELNGNAMTPSSRVSAINIVGDLLRKVGLERFLCRDCGKVKCSCDVSTEQQNTVLETNVHEHDPINNVDNSVEYRKGTFTRQYSKSEQSNTTANKTMPLTPKSSEPFERSYHNENAKLRRSFIVRSREGIENLLNFSSARKALESKLASCRGTVRPKESPNQTSDVNKDYRCVLKN-