Monarch geneset OGS2.0

DPOGS213630
TranscriptDPOGS213630-TA1458 bp
ProteinDPOGS213630-PA485 aa
Genomic positionDPSCF300033 + 1104137-1106691
RNAseq coverage253x (Rank: top 41%)
Annotation
HeliconiusHMEL0066491e-17563.97% 
BombyxBGIBMGA011692-TA8e-9451.93% 
DrosophilaCG33722-PC3e-6534.35% 
EBI UniRef50UniRef50_B0WHV71e-6332.88%Tether containing UBX domain for GLUT4 n=2 Tax=Culicinae RepID=B0WHV7_CULQU
NCBI RefSeqXP_967570.11e-6833.33%PREDICTED: similar to CG33722 CG33722-PC [Tribolium castaneum]
NCBI nr blastpgi|910784602e-6733.33%PREDICTED: similar to CG33722 CG33722-PC [Tribolium castaneum]
NCBI nr blastxgi|910784603e-6933.33%PREDICTED: similar to CG33722 CG33722-PC [Tribolium castaneum]
Group
Gene OntologyGO:00055157.8e-06protein binding
KEGG pathwayxtr:4482952e-13 
 K14011 (UBXN6, UBXD1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[2-57] IPR0215697.8e-16GLUT4 regulating protein TUG
[345-408] IPR0010127.8e-06UBX
Orthology groupMCL14402 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213630-TA
ATGAAGGTTCACTGTACGGCGGACACAAATATCTTACAGGTTTTAGAAGATGTTTGTGCAAAACACGGATTTGAAGCTACTGAGTATGACCTAAAACACCATAACCATGTCCTTGATTTAACAACAACCATTCGGTTTTGCAACTTGCCGAATAAGGCTTTACTTGAAATGGTTGAAGCTGAAAGGAAAAGACTGGAATCAAATGTCACAGTTGGATTAATGTTAAATGATGGTGAAAGAATAATGGGAGATTTCTCGCCAAACACGTCTCTTTATGACTTAATAACATCACTAGCACCAAACGAATTGCCTTCATTTAAGAATCCAACAATTTTATATATGAGACAGGAAGTAATAGGGCTGCCTGCTTTAAAAGAAAAGACTTTACGGCAGCTTGGACTGCTAACAGGCAGGGCGATACTTCGTCTATTAGACAAAACTGAACAAGCTACGCAAGCAAATGTCTCATCAGTTTATAGACGAATTCCTGAGAAAGTGGAATTGATGAGTAATGAAAAGAAACTACAGGAAAATAAAATTAATGACAAAGATGCAGGTCCATCAGGAGTACACATACATGAAGATACACATACAACATTTGATCCCATTAAATTGATTCAAAGAGAAAAAGAAACCAAGGCAGGTCCTCCGGCCCAATCACTTGAAAGCTCCAATACTGAAAGCATGGACACAAGCGAACCACAACATTGTGGAAGACCAATAAAAGAAGAAAAACCAGAACCTCCCAAGCCAGTTATGACACAAGAAAACCTTGAAAGACGTCTCAGAATTGAAGAAGAAGTTACTTTTGTAGGATCTCAAAAAGCTATAGCATTTATGCAGCCTGATATCGAGGAAGATGAAATATCAGACTTGCCGGATGACTTTTATGAGCTATCAATTGAAGAAGTGCGAAAGATGTATCATGAGTTACAGCAACGTCGTATCGAACTAGAAAATACCCCAATGCTTACTACAACAAAACGAGATGAAATTGCACAACAGACATCACTTCAAAAGCTTAACACATATAAAAATGTTGTCGTGAGAATTCAATTTCCTGACAATATTATCCTTCAGGGCGTGTTTACACCAACAAACACAGTGCAAGATGTTCAAAACTTTGTTAGAGAGCACCTACATCATTCTGACAAACCATTTCACATATTTACAACTCCATTAAAGGAAATGTTGGATCCTAAAATGACATTGCTTGAAGCTAAATTTGTGCCTTGTGTTCACATGCACTTTAAGTGGATTGAAGGGGGCGCCGTGGAGCCGTACTTAAAAGAGGAAATATACTTAAAAAAGACTACAAGTGATGCAGCAAGTATACTGGCATCGAAATATCGCGCGCCTAATAGAAGGAAGTTGGAGGAGTCAACAAACAATCCTCAAAACGGTAATCAACCCTCGTCATCAAAACAAAGCAAGATGCCAAAATGGTTCAAGAAATAG

Protein sequence:

>DPOGS213630-PA
MKVHCTADTNILQVLEDVCAKHGFEATEYDLKHHNHVLDLTTTIRFCNLPNKALLEMVEAERKRLESNVTVGLMLNDGERIMGDFSPNTSLYDLITSLAPNELPSFKNPTILYMRQEVIGLPALKEKTLRQLGLLTGRAILRLLDKTEQATQANVSSVYRRIPEKVELMSNEKKLQENKINDKDAGPSGVHIHEDTHTTFDPIKLIQREKETKAGPPAQSLESSNTESMDTSEPQHCGRPIKEEKPEPPKPVMTQENLERRLRIEEEVTFVGSQKAIAFMQPDIEEDEISDLPDDFYELSIEEVRKMYHELQQRRIELENTPMLTTTKRDEIAQQTSLQKLNTYKNVVVRIQFPDNIILQGVFTPTNTVQDVQNFVREHLHHSDKPFHIFTTPLKEMLDPKMTLLEAKFVPCVHMHFKWIEGGAVEPYLKEEIYLKKTTSDAASILASKYRAPNRRKLEESTNNPQNGNQPSSSKQSKMPKWFKK-