Monarch geneset OGS2.0

DPOGS202193
TranscriptDPOGS202193-TA1026 bp
ProteinDPOGS202193-PA341 aa
Genomic positionDPSCF300149 - 365678-367048
RNAseq coverage410x (Rank: top 30%)
Annotation
HeliconiusHMEL0091880.085.34% 
BombyxBGIBMGA013491-TA2e-15773.70% 
DrosophilaCG7671-PA2e-5430.81% 
EBI UniRef50UniRef50_D6WMU73e-8442.82%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WMU7_TRICA
NCBI RefSeqXP_968010.16e-8542.82%PREDICTED: similar to nucleoporin Nup43 [Tribolium castaneum]
NCBI nr blastpgi|910925421e-8342.82%PREDICTED: similar to nucleoporin Nup43 [Tribolium castaneum]
NCBI nr blastxgi|910925422e-8442.82%PREDICTED: similar to nucleoporin Nup43 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.2e-29protein binding
KEGG pathway 
InterPro domain[11-340] IPR0110461.2e-29WD40 repeat-like-containing domain
[12-289] IPR0159437e-29WD40/YVTN repeat-like-containing domain
Orthology groupMCL12715 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202193-TA
ATGCCTATAGATGTCCAAGGTACTTTTGTATCTCAGAAAATAAATAAAGTGCGATGGATTCCCGAAGATTACATGGAAACGAAACATTTTTTCACTGGAAGTTGGGACGACGATGAAAATTCTATAAAAGTTTGGAGCTTTGAGACTGCCAATGAAGATGAAGATGTAGAGTATCCCCGGCAGTTATCGGAATATAAAGTTGATGGTGATGTGACTGAAATTAAGTTCACAAACAAAAATGTGATAGCAGTTTCTATATCCAATGGTGATGTGAAAATGCTCGAGATTAGTGCTTATGATAAAGAATCTCCTTTGAAAGAAGTTTATACTTGGAACAACTTGCACAATTACGGATTGGAAAAATGTTCCTGTACATCCTTGGATACCTTAGAAGGAGATATAGCTAGTATAGGAGAGGATGGTAATGTTAACATATTAAATGGAAGACGAGGGGACATTTCACAAACTATTAAAGGAGCTGATAGCTGCTCTCTACACTCCGTCTGCTTTATAAAATTAAATGAGGTCATAACGGGAAACATCAGGGGCCACATGAAGATCTGGGACATAAGATCATCAACCAACAAACCGTCGGCAGCATTCCTACTAGCGGGTGATGAGTTAGCTGCAACCTGTATCATCCATCACCCAACACAACCCCATATTGTCTTAGCTGGTAGCGAGTCTGGAGCATTAGCTTTGTGGGATTTGAGAATGAATCAGTTCCCTACATCTCTGCTTAATGCACACGGGGGTGGCGTCACAGAAATGCAATTCCACCCAGAAAATCCTAATAAACTGCTAACTACTTCAGTTTCTGGTGAAATTTGGGAGTGGAATATGGACATGTTGACAAAGAAAATGTCCGATGATTACATGCCGGTGGACGATAAAACAAACATGAATGTAAACTCCTTGATGCCGACACTACATAAAGCCATCAATACATTGCATTGCGACCGAGGAAGAACTTTGTGTGGTGCAGATAACGAAGCTATATATTTAATAAAGAATCTGAGATATTAG

Protein sequence:

>DPOGS202193-PA
MPIDVQGTFVSQKINKVRWIPEDYMETKHFFTGSWDDDENSIKVWSFETANEDEDVEYPRQLSEYKVDGDVTEIKFTNKNVIAVSISNGDVKMLEISAYDKESPLKEVYTWNNLHNYGLEKCSCTSLDTLEGDIASIGEDGNVNILNGRRGDISQTIKGADSCSLHSVCFIKLNEVITGNIRGHMKIWDIRSSTNKPSAAFLLAGDELAATCIIHHPTQPHIVLAGSESGALALWDLRMNQFPTSLLNAHGGGVTEMQFHPENPNKLLTTSVSGEIWEWNMDMLTKKMSDDYMPVDDKTNMNVNSLMPTLHKAINTLHCDRGRTLCGADNEAIYLIKNLRY-