Monarch geneset OGS2.0

DPOGS206112
TranscriptDPOGS206112-TA1008 bp
ProteinDPOGS206112-PA335 aa
Genomic positionDPSCF300028 + 578872-581815
RNAseq coverage79x (Rank: top 64%)
Annotation
HeliconiusHMEL0140653e-5543.96% 
BombyxBGIBMGA006833-TA7e-17482.39% 
Drosophila% 
EBI UniRef50UniRef50_D6WZS01e-7244.78%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZS0_TRICA
NCBI RefSeqXP_001814555.13e-7244.78%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|2700132064e-7244.78%hypothetical protein TcasGA2_TC011780 [Tribolium castaneum]
NCBI nr blastxgi|2700132062e-7244.78%hypothetical protein TcasGA2_TC011780 [Tribolium castaneum]
Group
Gene OntologyGO:00055153.5e-25protein binding
KEGG pathway 
InterPro domain[34-297] IPR0159433.5e-25WD40/YVTN repeat-like-containing domain
[29-250] IPR0110461.9e-23WD40 repeat-like-containing domain
[214-250] IPR0197811.5e-06WD40 repeat, subgroup
Orthology groupMCL18320 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206112-TA
ATGGATCCACAGAGGAAAGTTACTGCTGGACCTATGGCCCCGTCCAGCCTAAAAACAGCACGTAAAAGCTTTGCTGTAACCAAGACATATGTGAAAGATATGAGCCCTCAAAAGGAAATTATAACCGTCCACACAAGCACACCTAACATTGAACATAGATCGTTTCACAGTGAACAAAATCAAATTTATCAAGGAGAGGTGAATGTGCTGTCTATAATAGATACAAATAAAGAAATAATGTGTTGTAAATATACTGAGGACGTTAAAGACATTGCTGCTGGTTTTATCGACGGCACTATCAGACTGTTCGATTGTAACAATGGAGACTGCAAGCATATCCTTGTGGATGATGAGTGTAGAGCTTACCCCGGTCCGGTGACTTCTATAAAACACAGGCCGGTTAGCAAAGCCCATCCTATCACAAATATGCTTCTGTCATCTTATGTTAACGGATGTATAAAATGCTGGAAATATAAATATGAACAATGTTTATACACAATAAGAGAGAAACGACAGACATTAGGATTGTGTTATCATCCACGTTACAGCAAATTCGTAACATATGGAGATGACGCTAAGCTAAATATGTACGACGAGGAAGCACAGACGCAAGAGAGGGCATTCTACTCAAGTCAGCGAAAAAATATAAGAGATGGTCATACTTCAAGGATATTTTCATGTGTTTTTAATCCTAAATCTCACCACGAACTTATATCTGGTGGATGGGACGACACTATCATGTGCTGGGACGACAGGCAACCTTATTCAACGAGGTATTTCTCCGGAGTACACATGTGTGGGGAAGGTTTGGACTTTGATAAACCAGGGAAGCAAACGGAATGTTCTATACGTAACAATCCAGGTGCTATATACGCATTCGATTTTGGCACGATAAGGAGAAAACAAACCAAAATCCCAGACACATACAAAAAAATAAGCGAAATACAAAATTGTCCACGTGTCGCATTCGTTACTGGAAAGAGATTACAAACTGTAGACTTTGGTTGA

Protein sequence:

>DPOGS206112-PA
MDPQRKVTAGPMAPSSLKTARKSFAVTKTYVKDMSPQKEIITVHTSTPNIEHRSFHSEQNQIYQGEVNVLSIIDTNKEIMCCKYTEDVKDIAAGFIDGTIRLFDCNNGDCKHILVDDECRAYPGPVTSIKHRPVSKAHPITNMLLSSYVNGCIKCWKYKYEQCLYTIREKRQTLGLCYHPRYSKFVTYGDDAKLNMYDEEAQTQERAFYSSQRKNIRDGHTSRIFSCVFNPKSHHELISGGWDDTIMCWDDRQPYSTRYFSGVHMCGEGLDFDKPGKQTECSIRNNPGAIYAFDFGTIRRKQTKIPDTYKKISEIQNCPRVAFVTGKRLQTVDFG-