Monarch geneset OGS2.0

DPOGS200343
TranscriptDPOGS200343-TA1350 bp
ProteinDPOGS200343-PA449 aa
Genomic positionDPSCF300026 + 517899-519248
RNAseq coverage233x (Rank: top 44%)
Annotation
HeliconiusHMEL0020370.079.78% 
BombyxBGIBMGA005636-TA0.076.40% 
DrosophilaU3-55K-PA1e-11545.87% 
EBI UniRef50UniRef50_D0AB940.080.00%Similar toCG33505 n=74 Tax=Heliconius RepID=D0AB94_9NEOP
NCBI RefSeqXP_001361669.28e-11644.33%GA10166 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|2613359560.080.00%similar toCG33505 [Heliconius melpomene]
NCBI nr blastxgi|2613359560.080.18%similar toCG33505 [Heliconius melpomene]
Group
Gene OntologyGO:00055151e-53protein binding
KEGG pathway 
InterPro domain[83-432] IPR0110461e-53WD40 repeat-like-containing domain
[127-430] IPR0159437.8e-53WD40/YVTN repeat-like-containing domain
[217-254] IPR0197811.6e-07WD40 repeat, subgroup
[215-254] IPR0016802.1e-07WD40 repeat
[241-255] IPR0204722e-06G-protein beta WD-40 repeat
Orthology groupMCL13561 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200343-TA
ATGTCAACTTCTTTCTTTTTAAAAGGGAAATCCAGAAAATTACATAAAAGAAAAGGAGAAAAAATCACAAAGAAAAGAATAAAAAAACCAAATGATCCTAACCTAGAAAATGGTGGCGAATCTTCGGATAGCGATTTAGATCTAAAGAAATTTTCTGATGTAGAAGATTCAGAAAGTGATCACGAAACAGCTGAACAGAAGAAACTGCGGATAGCCAAAACATACCTTGAAGAAATAGAAAAAGAAGAAGCTAAGCGTGCAGAGCTTAAAGAATTGGACACTGCTATTGACAAGCGATTACTGAAAGACTATTTAGAACAGAAAGGTAAACTACGAATAGAAGTAGCTGATAATTATATCTGTCCCACTGAAGACGATATAAGAATTATCCGTGCGAAAGAACATCGCCTTTCCCTGACGTGTGTCTGTATCAGTAATGATGCCCAGTTTGCTATCACGGGTTGCAAAAGCGGAAGTATTATTAAATGGGGAATAAAAGAGAAAAGAAAACTTGGTACACTGACATTTAAAACACATTCCAATTTTCTTAAAGGTAGTATTTTATCAGCAGCTTTATCGTCAGACTCTAAATTTCTAGCAACATCCGACTCATCACCTGATATACAAATCTGGGATCCACAGACATTAAAACATATTCATACATTCAAAGGTCACAAGGACTCTGTTATAGGACTTGTTTTCAGAAAGAGCACACATACATTATACTCTGCTAGTAAGGACAGATCTGTTAAAGTGTGGTCTTTAGATGAAATGGCATATGTTGAAACACTATTTGGACATCAGTCTCCAATTACATCGATTGATGCTTTAACTAGAGAAAGGGCAATAACATCGGGGGGAAGAGATACATCTGTACGTATATGGAAAATTGTTGAAGAATCGCAACTCATTTTTAATGGGCCAGAAGGTAGTCTTGATGTTGTTAAACTATTAGATGAGGAACATTTTGTGTCCGGCAGTGATAATGGATCACTTTGTTTATGGAGTGCTTTAAAGAAAAAGCCCTTATGTACTGTACCAGAAGCACATGGTCAAGAGAATAATGTACCTCGTTGGATTGTCAGCTTAGCCACACTTTTGAATTCTGACGTATTTGCATCTGGCTCCTATGACAATAATGTTAGATTGTGGAAAGTTAGTAATTCATATAGAAATATTGTGCCTCTATTTAGTTTAAGTATAAGTGGTTTCATTAATTCTATGCAATTTACTAGTGATGGCAACCAGTTATATGTTGCTGTCGGACAGGAACACAAGATTGGCAGATGGTTTAAAGATGGCAGTGCAAAGAATGGTTTAGTCATTGTTAACTTTTTATTGAAGTCATGA

Protein sequence:

>DPOGS200343-PA
MSTSFFLKGKSRKLHKRKGEKITKKRIKKPNDPNLENGGESSDSDLDLKKFSDVEDSESDHETAEQKKLRIAKTYLEEIEKEEAKRAELKELDTAIDKRLLKDYLEQKGKLRIEVADNYICPTEDDIRIIRAKEHRLSLTCVCISNDAQFAITGCKSGSIIKWGIKEKRKLGTLTFKTHSNFLKGSILSAALSSDSKFLATSDSSPDIQIWDPQTLKHIHTFKGHKDSVIGLVFRKSTHTLYSASKDRSVKVWSLDEMAYVETLFGHQSPITSIDALTRERAITSGGRDTSVRIWKIVEESQLIFNGPEGSLDVVKLLDEEHFVSGSDNGSLCLWSALKKKPLCTVPEAHGQENNVPRWIVSLATLLNSDVFASGSYDNNVRLWKVSNSYRNIVPLFSLSISGFINSMQFTSDGNQLYVAVGQEHKIGRWFKDGSAKNGLVIVNFLLKS-