Monarch geneset OGS2.0

DPOGS215407
TranscriptDPOGS215407-TA1065 bp
ProteinDPOGS215407-PA354 aa
Genomic positionDPSCF300088 + 432762-436369
RNAseq coverage385x (Rank: top 31%)
Annotation
HeliconiusHMEL0097213e-12761.50% 
BombyxBGIBMGA012406-TA6e-9769.85% 
Drosophilattm50-PA9e-8247.04% 
EBI UniRef50UniRef50_Q9W4V81e-7947.04%Mitochondrial import inner membrane translocase subunit TIM50-C n=33 Tax=Pancrustacea RepID=TI50C_DROME
NCBI RefSeqXP_002100221.19e-8149.69%GE16923 [Drosophila yakuba]
NCBI nr blastpgi|2897424475e-8248.02%mitochondrial import inner membrane translocase subunit TIM50-C precursor [Glossina morsitans morsitans]
NCBI nr blastxgi|185432996e-8348.17%tiny tim 50 [Drosophila melanogaster]
Group
Gene OntologyGO:00055151.6e-22protein binding
KEGG pathway 
InterPro domain[172-340] IPR0232143.9e-26HAD-like domain
[187-285] IPR0042741.6e-22NLI interacting factor
Orthology groupMCL12359 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215407-TA
ATGTCTCTGAGAAAAGTACTTTTCTTAACGCGTGCGGTCTGTTGTAATCCTGTTAAGCTACAAGTCCCAAAATTATTAAGGCAAAACATGTGTCTACTTTCTGTTAATTGTGCGAGAACCTATTCTACTGAGAATATAGGTGATAAGAATAAGAATGAGAAAGTTCCTGAGAAAGTAGACATTCTGGGACGATTTTTCCCCCAGACTCCAGGCAATGTTCAAGATGCACAGGAAGTTAAAAAAGAACAGGAGAAGTTTGAACAGGAACAGAAAGAAAAAAACCAAGAGAATGAGAACAGCTGGAGAAGGATGAAGATTGGATTTGCAGTTTTTGGTGGTGCGATGACAGTGATGGGTGGTTGTATGGTAATTGAGATGGGAGCTCCTCGACGCTCCGATGACGGGACACCCCTGGAGGACGAGTTTTCCCACTTGCCCCTACCCCTGCAGTATCTCAGACGAACATGGAAGGAACTTACTTTTTATGAAAAGATGATAAAAGAGCCGTCTCGAGAGAAACTATTACCGGACACGTTGCCGCCTCCATACCAGCCAACCTACACCCTGGTGCTGGAGTTCACCGACGTCCTCGTGCATCCGGACTGGACCTATCAGACCGGATGGAGATTTAAGAAGCGGCCGGGAGTAGACCAGTTCCTCCAAACGGTCGCCAATTCCGACTACGAGGTGGTGATCTTTACTTCGGAGAACGCGTTCATGATCTATCCCGTGTTGGAGAAGTTGGACCCCGAGAACAAATTCATCTCATACAAGCTGTTCAGAGATTCCACACACTTCATAGACGGCGTACACGTCAAGAACTTGGAGGGGTTGAATAGAGATCTGTCCAAGGTTGGTGTCCGTTTACTGAAGGTAACACCGATCGCCATGTCTAACGTGACGGACGTGCGCGAGGTGTTGCGCTACTACGGACAGTTCGACGACCCCATCGCGGCCTTCAGAGAGAACCAAAGACGGCTCATGGAGCAGATGGCGGACAGGGAGAAGGAGACACAGGAGCAGCCGCTAGCACGCTCCTGGCTGAGACCTTTCACACGCCGCTAG

Protein sequence:

>DPOGS215407-PA
MSLRKVLFLTRAVCCNPVKLQVPKLLRQNMCLLSVNCARTYSTENIGDKNKNEKVPEKVDILGRFFPQTPGNVQDAQEVKKEQEKFEQEQKEKNQENENSWRRMKIGFAVFGGAMTVMGGCMVIEMGAPRRSDDGTPLEDEFSHLPLPLQYLRRTWKELTFYEKMIKEPSREKLLPDTLPPPYQPTYTLVLEFTDVLVHPDWTYQTGWRFKKRPGVDQFLQTVANSDYEVVIFTSENAFMIYPVLEKLDPENKFISYKLFRDSTHFIDGVHVKNLEGLNRDLSKVGVRLLKVTPIAMSNVTDVREVLRYYGQFDDPIAAFRENQRRLMEQMADREKETQEQPLARSWLRPFTRR-