Monarch geneset OGS2.0

DPOGS204324
TranscriptDPOGS204324-TA1341 bp
ProteinDPOGS204324-PA446 aa
Genomic positionDPSCF300142 - 153011-154351
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0023230.069.02% 
BombyxBGIBMGA007225-TA9e-16160.19% 
DrosophilaCG6053-PB1e-6732.65% 
EBI UniRef50UniRef50_D6WDC86e-7134.05%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WDC8_TRICA
NCBI RefSeqXP_001605399.15e-7435.18%PREDICTED: similar to ENSANGP00000015395 [Nasonia vitripennis]
NCBI nr blastpgi|3454789641e-7235.18%PREDICTED: dynein intermediate chain 3, ciliary-like [Nasonia vitripennis]
NCBI nr blastxgi|3454789641e-7035.18%PREDICTED: dynein intermediate chain 3, ciliary-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055157.2e-33protein binding
KEGG pathwaytca:6603893e-71 
 K11143 (DNAI2)maps-> Huntington's disease
InterPro domain[18-358] IPR0110467.2e-33WD40 repeat-like-containing domain
[234-355] IPR0159432.8e-30WD40/YVTN repeat-like-containing domain
Orthology groupMCL25323 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204324-TA
ATGTATCAAACTTACTTCCACAACATGAAACAACAAGATCCAGTGGAGAGGCATAATGTTCAAATTGCAAATACATTTCGTGACAAATTTCATCGACCTGTATCTTGTATTGTTTGGACCCAAGAAAAGCATTCAAAGTTGGTTGCAAGTTATTCTTACAAAACGCGGCTAGTTGAACCCGAATCAAGTAATGAAAACGTTTGCTTTGTATGGGATATCAATAAACAAACAGAGCCAATCTACGAATTTTTGCCAAAGCATAGTTGTTGGCAAGTAGCCTGTTCACCAGTAGACCCTGATTTAATAATTGCAGGGTTGGACAATGGTACAGTAAATGTTTTCGATATTCGAGCTGGGATCAATTGTGTGACAAGCAGCTCTATTTATAATTCTCATTTTGCTCCTATAACATCGTTGTTGTATACTCACTCAAGAACAAACACAGAATTATTCACAGGATCTCCCGATGGTCAATGTCTCTGGTGGGATGTGAGAAATCTCTCTAATCCTCTAGACCAACTTCCAATGTCAATAAAATTATCAGCCGATGAGACACCTAATTTAAGTAATGCAGAAGGTGTTAGTAATTTAGAATTTGATCGTGGACTTCCAACCAAATTTTTATGTGGTACAGAATCAGGTTTAGTTATAAACGCCAATCGTATGGGAAGAAGTCATTCAGAAATATTAACCTCATATTGGGAAGCACACTCGGGCCCCGTAAGGGCGGTACATAGAAGTCCATGCACTTTAAGAATGTTTATTACTTGCGGAGATTGGAGTGTACGGATTTGGAGTGAAGAGGTACGCACTGCTCCAATAATAGTAACCCCACCTTATCGTTATGAAGTCACAGATGTTGTTTGGGCACCATTACGGTATTCTTGTTATATGGCTATAAGTGACGATGGTGTATTTTATTTCTGGGATCTTTTAAGAAAACAGAAACAACCTGTCGCAACTCTCAATATCTCTAAATTTGGTCTAACTAAATTGAGCCCTCATTGGAAAGGTGAACTTACTGCTGTTGGAGATAATGATGGATCGGTTTTTCTTCTTAATTTATCTGATAATATGGTAATACCTGGTGCTAACGATAAACAGTTAATGCACCAAACTTACGATCGCGAAACCAGACGTGAACATATTATAGACAATCGTGTAAAAGAGTTGCGATTAAAGGCTCGTGTTGAAGAAGAAATGCCTGTCCAAATAGAACCCGATGAATCTTCTTATGAAGATGATTTTGAGAGACAAACGGGAGAATATTTTGATTTAGTCAAAAAAGAAATGACATTAGTGGGTGGTGTATTTCCAGAGAATTGTACATTAACTGAATGA

Protein sequence:

>DPOGS204324-PA
MYQTYFHNMKQQDPVERHNVQIANTFRDKFHRPVSCIVWTQEKHSKLVASYSYKTRLVEPESSNENVCFVWDINKQTEPIYEFLPKHSCWQVACSPVDPDLIIAGLDNGTVNVFDIRAGINCVTSSSIYNSHFAPITSLLYTHSRTNTELFTGSPDGQCLWWDVRNLSNPLDQLPMSIKLSADETPNLSNAEGVSNLEFDRGLPTKFLCGTESGLVINANRMGRSHSEILTSYWEAHSGPVRAVHRSPCTLRMFITCGDWSVRIWSEEVRTAPIIVTPPYRYEVTDVVWAPLRYSCYMAISDDGVFYFWDLLRKQKQPVATLNISKFGLTKLSPHWKGELTAVGDNDGSVFLLNLSDNMVIPGANDKQLMHQTYDRETRREHIIDNRVKELRLKARVEEEMPVQIEPDESSYEDDFERQTGEYFDLVKKEMTLVGGVFPENCTLTE-