Monarch geneset OGS2.0

DPOGS208718
TranscriptDPOGS208718-TA1026 bp
ProteinDPOGS208718-PA341 aa
Genomic positionDPSCF300043 - 145-1827
RNAseq coverage419x (Rank: top 29%)
Annotation
HeliconiusHMEL0152740.093.43% 
BombyxBGIBMGA003365-TA3e-17889.55% 
Drosophilal(1)G0095-PA3e-4033.90% 
EBI UniRef50UniRef50_D6WNP81e-8852.79%Putative uncharacterized protein n=3 Tax=Neoptera RepID=D6WNP8_TRICA
NCBI RefSeqXP_970789.12e-8952.79%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
NCBI nr blastpgi|910820394e-8852.79%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
NCBI nr blastxgi|910820392e-8552.79%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
Group
Gene OntologyGO:00054882.7e-17binding
KEGG pathway 
InterPro domain[44-313] IPR0160242.7e-17Armadillo-type fold
[49-276] IPR0119892.1e-13Armadillo-like helical
Orthology groupMCL12026 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208718-TA
ATGGCTGCAGTTCTCAAAAAGAGGGCCTTAGCGGAATACAATAAATCATTCCAGGATGGTCCATCGTCGAAAAAGTTACATCTAGCCAAAAAACCGCTTATAGGTAGTTCAGCTGCAGCTTTCGTGGGACTGCTAGAGAAATGTAAATCTAGTGATGAGGCATTACAGTTACTATTACGCATCTCAGATTGCCTACAGTTCCAAGAATCAGATGTCGAGGAAGCCATCAAGAAGCTATCAGAACATTTTCAATCTGAAGAGGAGTCAGTGGTCAGGGTTAAAATACTCTGGCTCTTCTGTGATATAGGTCTAGAATGCCCCGGAGCTAATTTGAATAACTTGATTGATGAGACCATACATTTAATTAAAAATGAAACATCACATAAGGTGATAGCACAAGGTATAGCGACATTACTTAAACTTGGTACCAAGCTTAGTGACGATAAGATTCTCATGATGAGGCTTGTCGGTGTAGCCAGAGACAATCTCAAGGATACCAGCCACCAAGTGAAATGTAAATGTCTGCAGTTGATAAGTGAACTCTACCCGATATATCCGGAGTCTGATAGAACAGTAGAAATGACGGCAGAGGCTGATACTATTGTCAAGTTGCTCGGGGACTACAGTAATGCCGAGGATGCGAGAGTGAGATGTGAGGCATTTCAGTCACTCTTGACGTTGCATGAGAGAGGTCAAACATTAAGTGCTAGTCTATATGAGCCAGTGTGTGCTGCACTCTCAGATGACTATGAGATAGTAAGGGAAGTAGCACTCAAACTGGTGTGGCTACTAGGAAATAAGTACCCAGAAAATTCAGTAACATTACAAGACGGAGAGACAACAATTCGTCTCGTAGATGATGCGTTTATTCGTATTTGTTCGGCTGTTAATGATCTGTGTATGCAAGTACGAGCTCTGTCCTGTTCTCTGCTGGGCACAACAAGAGCTGTCTCCGATCGCTTCCTCTTGCAAACGTTGGATAAACAACTCATGAGTAACATGAAGGTTGGTACACTGTTCTGTTAA

Protein sequence:

>DPOGS208718-PA
MAAVLKKRALAEYNKSFQDGPSSKKLHLAKKPLIGSSAAAFVGLLEKCKSSDEALQLLLRISDCLQFQESDVEEAIKKLSEHFQSEEESVVRVKILWLFCDIGLECPGANLNNLIDETIHLIKNETSHKVIAQGIATLLKLGTKLSDDKILMMRLVGVARDNLKDTSHQVKCKCLQLISELYPIYPESDRTVEMTAEADTIVKLLGDYSNAEDARVRCEAFQSLLTLHERGQTLSASLYEPVCAALSDDYEIVREVALKLVWLLGNKYPENSVTLQDGETTIRLVDDAFIRICSAVNDLCMQVRALSCSLLGTTRAVSDRFLLQTLDKQLMSNMKVGTLFC-