Monarch geneset OGS2.0

DPOGS204352
TranscriptDPOGS204352-TA1548 bp
ProteinDPOGS204352-PA515 aa
Genomic positionDPSCF300142 + 366504-368051
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0031937e-17454.74% 
BombyxBGIBMGA007055-TA2e-17452.68% 
DrosophilaCG6053-PB1e-9637.72% 
EBI UniRef50UniRef50_D6WDC84e-10140.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDC8_TRICA
NCBI RefSeqXP_971718.18e-10240.28%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastpgi|910929242e-10040.28%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
NCBI nr blastxgi|910929241e-9839.73%PREDICTED: similar to AGAP011539-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.6e-30protein binding
KEGG pathwaytca:6603892e-101 
 K11143 (DNAI2)maps-> Huntington's disease
InterPro domain[98-429] IPR0110461.6e-30WD40 repeat-like-containing domain
[136-428] IPR0159432.8e-25WD40/YVTN repeat-like-containing domain
Orthology groupMCL25310 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204352-TA
ATGAATACTATACGTGTTACTCTACAAGACCGTGGAGTTAACCATGCAGAAGGCGGGTGGCCAAAAGATGTTAATGTTAATGACGAGGAAGCGACAGCCCGATATCGTAGACGTTTTGAACGAGATGATGCATATGTGGGTGCTGTACTTTCTTCAAATCCTTATTTCGAACACTTAATTCATCAAAATAATGCTATTGAAATGTACAATATGTATTTCAAGGAAATGAAGCCCCAGAAACCAGTGGAAACATATTCAGTGAAAATAAAAAATGCTTATAAAGATTTATTTCAGCGTCCCGTTGCTTCAATTGCTTGGACCTTCGAGGCTAATTCTAAACTTGTTGTCTCCCATTGTTATAAAAAAATGCTTTTAGGTAAACCACTTAACACAAGACTAGAGGGCAATGTTTGGGATTTAGAAAATGCGAATGAACCAGCGGAACAATTTTTGTCTCCAACAGCATGTTGGCAAATAGTTTGTTCACCTTCTCATCCCAAAGTAATGATGGGTGGCCTCGAAGATGGCAGAGTATGTATTTTTGATTTGCGTGAAAAAATAGAACCAGTTCGGTTCAGCATGATGCATTTAGCCCATAGAGATCCCGTTAGTGCACTATTATTCTTACATAGCAGACTTAATACTGAATTCTTTAGTGGTTCTAGCGATGGAAAATGTATGTGGTGGGATATAAGAAATATATCTGAGCCTGTCGATTCACTTATTATGTCGATAAATCCGACTTCTCAAGATTTTGTATCTATGGCTGATGCAGAAGGTATAAGTTGTTTGCAATATGATAAAACTTTCCCAACCAAATTTTTGTGTGGTACTGACACAGGTTTAGTTATTAATGTGAATCGTAAAGGCAAAACTCATCAAGAAATTATGAGTGCCATTTTCAATGCTCATTATGGTCCAGTAAAAGCCCTTTATCGTAGTCCTTGCACGACAAAAGTTTTTATTACATGTGGCGACTGGACCGTTAATATTTGGAGTGATGATGTGCATTGCTCCCCTATAATATGTGGTAAAGCACATAGAATGCAAATATCTGACGTTAGTTGGTCTCCTCAAAAAATGTCAGGATACATGTCTATAAGTTATGATGGAAAATTTAGGTACTGGGATCTATTAAGACGACACTATGGTGCTATTGTCACAAAACCGGTTTCAAAATTTCCGCTTTTAAGACTGAAACCAAACAAACAGGGTAAATTTGTTGCCGTTGGAGACACACAAGGAATTGTGAATCTTTTATCGTTATCTGATAGTCTTGTTATTTCGGATAATAAAGATAAGACTTTAATGAACCAAACATTCGAACGTGAAGGGCGAAGGGAACATATAATAGAAACAAGAATAAAAGAAATAAGATTAAAATTGAGAGAAGTTGAAACGGGAGTAGATTCTGATATTGATTTAATGGATGAAAATGTTATAAAAACCGCAGAAGATGAATTTAAGAGAGTGGTTACTGAGGAATTGAAAAGATCTGGTACAACACACATTTCTAGCGGAAAACGTTATCCAATGCGAAACCGTTAA

Protein sequence:

>DPOGS204352-PA
MNTIRVTLQDRGVNHAEGGWPKDVNVNDEEATARYRRRFERDDAYVGAVLSSNPYFEHLIHQNNAIEMYNMYFKEMKPQKPVETYSVKIKNAYKDLFQRPVASIAWTFEANSKLVVSHCYKKMLLGKPLNTRLEGNVWDLENANEPAEQFLSPTACWQIVCSPSHPKVMMGGLEDGRVCIFDLREKIEPVRFSMMHLAHRDPVSALLFLHSRLNTEFFSGSSDGKCMWWDIRNISEPVDSLIMSINPTSQDFVSMADAEGISCLQYDKTFPTKFLCGTDTGLVINVNRKGKTHQEIMSAIFNAHYGPVKALYRSPCTTKVFITCGDWTVNIWSDDVHCSPIICGKAHRMQISDVSWSPQKMSGYMSISYDGKFRYWDLLRRHYGAIVTKPVSKFPLLRLKPNKQGKFVAVGDTQGIVNLLSLSDSLVISDNKDKTLMNQTFEREGRREHIIETRIKEIRLKLREVETGVDSDIDLMDENVIKTAEDEFKRVVTEELKRSGTTHISSGKRYPMRNR-