Monarch geneset OGS2.0

DPOGS202843
TranscriptDPOGS202843-TA1062 bp
ProteinDPOGS202843-PA353 aa
Genomic positionDPSCF300018 + 1000511-1003067
RNAseq coverage358x (Rank: top 33%)
Annotation
HeliconiusHMEL0092990.093.48% 
BombyxBGIBMGA010484-TA2e-17792.24% 
DrosophilaCG14614-PB0.092.13% 
EBI UniRef50UniRef50_Q9VR530.092.13%CG14614 n=15 Tax=Eukaryota RepID=Q9VR53_DROME
NCBI RefSeqXP_395370.10.094.38%PREDICTED: similar to CG14614-PA [Apis mellifera]
NCBI nr blastpgi|3838610730.094.67%PREDICTED: DDB1- and CUL4-associated factor 7 [Megachile rotundata]
NCBI nr blastxgi|3454981720.093.86%PREDICTED: LOW QUALITY PROTEIN: DDB1- and CUL4-associated factor 7-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055151.9e-36protein binding
KEGG pathway 
InterPro domain[30-346] IPR0159431.9e-36WD40/YVTN repeat-like-containing domain
[15-345] IPR0110462.1e-31WD40 repeat-like-containing domain
[267-305] IPR0197811.7e-06WD40 repeat, subgroup
[265-305] IPR0016803.8e-06WD40 repeat
Orthology groupMCL13962 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202843-TA
ATGTCGATGCCACATAGTAGCTCGAGCACTACGAGTTCAAGCACAAAACGTAAAGAAATATACAAATATCAAGCACAATGGCCTCTGTACTCCATGAATTGGTCAGTTCGTCCGGACAAAAGGTTCCGTTTGGCTCTGGGCAGTTTTGTGGAAGAGTATAATAATAAGGTACAAATCATATCATTGGATGAGGAGACGAGTGAGTTCAGTGCTAAGAGCACATTCGACCATCCTTATCCAACCACAAAAATTATGTGGATTCCGGACAGCAAGGGGGTGTATCCTGATCTTCTGGCTACTAGTGGGGACTACCTCCGCATCTGGCGTGCCGGTGAACCGTACACATTGTTTGAGTGTGTTTTGAATAATAACAAGAACTCAGACTTTTGTGCCCCTCTCACATCTTTTGATTGGAATGAAGTCGACCCAAATTTGATTGGTACAAGTAGTATCGACACCACTTGCACCATCTGGGGCTTGGAGACAGGACAGGTGATGGGGAGAGTCAATGAGGTCTCTGGACATGTGAAGACTCAACTGATTGCTCATGATAAGGAGGTGTACGACATAGCGTTCAGTCGCGCGGGAGGAGGCCGCGACATGTTCGCCTCGGTGGGAGCGGACGGCTCTGTTCGCATGTTTGACCTGAGACACCTCGAACATTCCACCATTATATATGAGGATCCACAACACACGCCGCTGCTCCGGCTGGCGTGGAACAAGCAGGACCCCAACTACTTGGCCACTATAGCGATGGACGCGTGTGAAGTCATCATACTTGACGTGAGGGTTCCTTGCACGCCGGTCGCCAGGCTCAACAACCACAGAGCCTGTGTTAACGGCATCGCTTGGGCTCCGCACAGTTCGTGCCACATCTGCACGGCGGGCGACGACCACCAGGCGCTGATCTGGGACATCCAGCAGATGCCGCGCGCCATCGAGGACCCCATCCTGGCGTACACCGCCGCCGAGGGTGAGGTCAACCAGATCCAGTGGGGCGCCACCCAGCCCGACTGGATCGCCATATGCTATAACAGACACACCGAAATATTACGCGTCTGA

Protein sequence:

>DPOGS202843-PA
MSMPHSSSSTTSSSTKRKEIYKYQAQWPLYSMNWSVRPDKRFRLALGSFVEEYNNKVQIISLDEETSEFSAKSTFDHPYPTTKIMWIPDSKGVYPDLLATSGDYLRIWRAGEPYTLFECVLNNNKNSDFCAPLTSFDWNEVDPNLIGTSSIDTTCTIWGLETGQVMGRVNEVSGHVKTQLIAHDKEVYDIAFSRAGGGRDMFASVGADGSVRMFDLRHLEHSTIIYEDPQHTPLLRLAWNKQDPNYLATIAMDACEVIILDVRVPCTPVARLNNHRACVNGIAWAPHSSCHICTAGDDHQALIWDIQQMPRAIEDPILAYTAAEGEVNQIQWGATQPDWIAICYNRHTEILRV-