Monarch geneset OGS2.0

DPOGS213949
TranscriptDPOGS213949-TA1140 bp
ProteinDPOGS213949-PA379 aa
Genomic positionDPSCF300226 + 176-6043
RNAseq coverage373x (Rank: top 32%)
Annotation
HeliconiusHMEL0152748e-5895.50% 
BombyxBGIBMGA003366-TA1e-7957.00% 
Drosophilal(1)G0095-PA7e-4169.37% 
EBI UniRef50UniRef50_D6WNP81e-4171.17%Putative uncharacterized protein n=3 Tax=Neoptera RepID=D6WNP8_TRICA
NCBI RefSeqXP_970789.13e-4271.17%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
NCBI nr blastpgi|910820395e-4171.17%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
NCBI nr blastxgi|910820392e-3871.17%PREDICTED: similar to Integrator complex subunit 4 (Int4) [Tribolium castaneum]
Group
Gene OntologyGO:00054884.9e-12binding
KEGG pathway 
InterPro domain[2-107] IPR0160244.9e-12Armadillo-type fold
[2-99] IPR0119895.7e-07Armadillo-like helical
Orthology groupMCL30167 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213949-TA
ATGGAGGTGCGCACTGCTGCTGTAGATGCTGTTTGTCAGTTGTCGCTGGTGAACCCGGTGTTCGCGACGACTTCCCTGGACTTTTTGGTTGATATGTTCAATGATGAGATAGAGGACGTCCGTCTGCGGGCCATAGACAGTCTCACACGCATCTCACACCACATCATACTCAGAGAGGACCAGTTGGAGATAATTCTGGGTGCTCTGGAGGATTACTCTATGGACGTCAGGGAAGGTCTACACAGGATGCTGGGGTCGTGTACCGTAGCTTCCAAGACCTGCCTCGAGATGTGTATAGACAAAATATTGGAGAATCTCAAACGTTATCCTCAGACTGTCGGAGATGGAGCCGCTGGTGGCTGGTGCTGCGAATTTCACGTCCCTGTTCGTCCGCGGAGTGGTGTCGTTGAACGAGGCCCTGACCGCGCCCTCCGGCCCGCCGCAGCACGCGCTGCAGCATCTCACCACGGATTGCCTCAGATTCGTGTCTCCAGACTCCAGCACCAGTTTAGCGGGGTGACTGAACAGGAGACGGCTTGCGTGTATCAGCTGGCGTTGCGCGTGTGCGCGGCGAGGCTCACGGCAGCGGTGGAGGGGGGAGGAGGGGGAGCGGTGCCGGTGGGGGGAGCAGCCGCTGCGGGAGGGGGAGGGGGGCCCGGACAAGCGGCCGCTCTGTCTTCTCTGGCAGCGCTGACACATCACGCTGAAGTGCTGGACAGGTTGTTGGCATCCGCTGCGAGCATTATAGAACCGGCTACGGACAACGACACAGTGTTGCGTTTCTGTGCTGGGCTCATAACCGGTGTGGCGCTGGAGGCTGAGGTGGTGAGGGTCAGCGACCCCGCTCAGCTGAGGGTCAAGGTCGCTTACCCTGACACGAGGGTACACGCCCTTGTGCCACCCAGAGATCACCTGCGACCTCTGGACCATACCAATACGAATGACGATGGAACCCAGAACGTCCGTTTGCTGACGAAGGTGTTGATATCTCACGGTGTGTGGACGGAGCCGTGCGGGGTCGATATATCTGTGTGCCTGGCCGTGGAGGACGGAGGCAGCCGGGAAGCTGCACCTCTAGTGGAGCTATGCCGACCCGTGCGCGTCACGGTGGCACCGAAACCAATCAAACGCGGCATATAG

Protein sequence:

>DPOGS213949-PA
MEVRTAAVDAVCQLSLVNPVFATTSLDFLVDMFNDEIEDVRLRAIDSLTRISHHIILREDQLEIILGALEDYSMDVREGLHRMLGSCTVASKTCLEMCIDKILENLKRYPQTVGDGAAGGWCCEFHVPVRPRSGVVERGPDRALRPAAARAAASHHGLPQIRVSRLQHQFSGVTEQETACVYQLALRVCAARLTAAVEGGGGGAVPVGGAAAAGGGGGPGQAAALSSLAALTHHAEVLDRLLASAASIIEPATDNDTVLRFCAGLITGVALEAEVVRVSDPAQLRVKVAYPDTRVHALVPPRDHLRPLDHTNTNDDGTQNVRLLTKVLISHGVWTEPCGVDISVCLAVEDGGSREAAPLVELCRPVRVTVAPKPIKRGI-