Monarch geneset OGS2.0

DPOGS205676
TranscriptDPOGS205676-TA1953 bp
ProteinDPOGS205676-PA650 aa
Genomic positionDPSCF300023 + 937960-942011
RNAseq coverage296x (Rank: top 38%)
Annotation
HeliconiusHMEL0073390.089.92% 
BombyxBGIBMGA001024-TA0.089.86% 
DrosophilaCG4050-PA0.064.01% 
EBI UniRef50UniRef50_E2BZ920.068.41%Transmembrane and TPR repeat-containing protein CG4050 n=13 Tax=Coelomata RepID=E2BZ92_HARSA
NCBI RefSeqXP_974172.10.069.31%PREDICTED: similar to GA17918-PA [Tribolium castaneum]
NCBI nr blastpgi|2700053810.069.31%hypothetical protein TcasGA2_TC007431 [Tribolium castaneum]
NCBI nr blastxgi|2700053810.069.42%hypothetical protein TcasGA2_TC007431 [Tribolium castaneum]
Group
Gene OntologyGO:00054889.9e-44binding
GO:00055153.9e-08protein binding
KEGG pathway 
InterPro domain[211-438] IPR0119909.9e-44Tetratricopeptide-like helical
[53-128] IPR0136181.9e-24Domain of unknown function DUF1736
[389-422] IPR0014403.9e-08Tetratricopeptide TPR-1
[389-422] IPR0197344.4e-07Tetratricopeptide repeat
Orthology groupMCL13137 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205676-TA
ATGGATGGAAGTTCTACGGAGCGACGTGGCAGCCCTGGTCGTTCTTGGCTGGAGTATTGGGGTAAGGGAGGGTGGGGATGTGCCTGTGGGGAAGCAGCCCGCAGGGTTGCCACCGTCTGCTGTGCTACATTGGCGCTTCTGGCTGCCAGACTACACGTTATGGGAGCACAGCTGCCAGTTTTCACACGATTCGACAACCCTGCTGGAGCCTCGCCCCCACCTGCGAGACATCTAACCTTCGCCTATCTTCCCGCGTTGAACGCCTGGCTGCTGACTCTGCCGGAGGCGTTGTGCTGTGACTGGACCATGGGTACTGTGGCTTTGTTACGATCGTGGAGCGACCCACGGAATATGGCCACGGCGGGTCTCGCAGTCATGCTTGTCGCTGGTACCATACATGCCTTGAGGACCAGATCTTCGGCATTATCGATGGGTCTTGCGCTACTTGTTTTGCCGTTTCTCCCCGCGTCAAACCTCTTCTTCCCCGTGGGTTTCGTTGTGGCTGAGCGTGTTTTATACATGCCGTCTATGGGCTGGTGTCTGTTAGTGGCACACGGTTGGAGGCTTGTGGCCAGGAAACGAGCGAAACTGGCCGCAGCTTCACTCGTGTTCCTCCTGCTGGCTTTTAGTGCCAAAACATACGTCAGAAACTGGGACTGGAAGACTGAATATACAATATTTGCATCGGGACTGAAGGTGAATCGTAATAACGCCAAGCTATACAACAACGTGGGTCACGCTTTAGAAGCTGAAGGGAAATACGGAGAAGCTTTAGAATTCTTCAAAATTGCTGTGAACGTCCAACCAGACGACGTTGGAGCCCATATCAACGTTGGAAGAACTTTCAATCATTTAGGCAAATATCAGGAGGCAGAAGCCGCTTACGTGAAAGCCAAATCTCTCCTACCGAAAGCCAAACCCGGGGAATCTTACCAAGCTAGAATAGCCCCCAATCACTTGAACGTCTTCCTTAATTTGGCTAATTTGATATCCAAAAACGCGACACGATTAGAAGAAGCTGACATGTTGTATCGGCAAGCGATCAGTATGAGAGCTGATTACACACAGGCCTATATAAACAGGGGTGATATTTTAATTAAATTAAACAGGACCAAGGAGGCCCAGGAGGTCTACGAACGGGCGCTGTTGTATGACAGTGGGAACCCTGACATTTATTACAATCTGGGGGTAGTGCTACTGGAACAAGGCAAGGCGTCCCAGGCGCTGGCGTATTTGGACAAAGCTCTGGAACTCGAGCCGGAACATGAACAGGCATTACTGAACTCTGCCATACTTCTGCAAGAGCTGGGAGCTGCAGACTTGAGACACCTTGCCAGACAAAGATTACTCAAATTGTTGGACAAAGATGCCACTAACGAGCGCGTCCACTTTAACCTCGGCATGGTGTGTATGGATGAGGGAGACGCGGAGTGCGCTGAACGCTGGTTCAGGGCCGCGGTTCATCTTAAACCGGACTTCCGCTCCGCTCTCTTCAACCTGGCTTTACTACTAGCTGACAGACGAAGACCCCTGGAGGCCGCGCCTTTCCTAAAACAATTGGTCAGACATCACCCCGATCATGTGAAAGCCCTAGTACTGTTGGGAGACATTTACATCAATTCGGTCAAGGATTTGGATGCTGCTGAAAGTTGCTATCGACGCATCCTCGAACTAGAACCAGACAACGTGCAAGCTCTCCACAATTTATGCGTTGTTGCTGTAGAAAGAGGGAAGTTAGCCGTTGCTGAAGAGTGTCTTACAAGAGCCGCGGTTTTGGCGCCACACGAACATTACATACAGCGCCATCTAGCGGTAGTACGCGCGAGACTGGCAGCTGTCTCGCTCACACACCCCAACACCCGACCATCCGACGCATCCGACGCACAAGTGCGAGCGAGATGGAACTACATCCCCCAACAACCCCCCGACCCACACGAGTCGGATCCCTCATAG

Protein sequence:

>DPOGS205676-PA
MDGSSTERRGSPGRSWLEYWGKGGWGCACGEAARRVATVCCATLALLAARLHVMGAQLPVFTRFDNPAGASPPPARHLTFAYLPALNAWLLTLPEALCCDWTMGTVALLRSWSDPRNMATAGLAVMLVAGTIHALRTRSSALSMGLALLVLPFLPASNLFFPVGFVVAERVLYMPSMGWCLLVAHGWRLVARKRAKLAAASLVFLLLAFSAKTYVRNWDWKTEYTIFASGLKVNRNNAKLYNNVGHALEAEGKYGEALEFFKIAVNVQPDDVGAHINVGRTFNHLGKYQEAEAAYVKAKSLLPKAKPGESYQARIAPNHLNVFLNLANLISKNATRLEEADMLYRQAISMRADYTQAYINRGDILIKLNRTKEAQEVYERALLYDSGNPDIYYNLGVVLLEQGKASQALAYLDKALELEPEHEQALLNSAILLQELGAADLRHLARQRLLKLLDKDATNERVHFNLGMVCMDEGDAECAERWFRAAVHLKPDFRSALFNLALLLADRRRPLEAAPFLKQLVRHHPDHVKALVLLGDIYINSVKDLDAAESCYRRILELEPDNVQALHNLCVVAVERGKLAVAEECLTRAAVLAPHEHYIQRHLAVVRARLAAVSLTHPNTRPSDASDAQVRARWNYIPQQPPDPHESDPS-