Monarch geneset OGS2.0

DPOGS215768
TranscriptDPOGS215768-TA1257 bp
ProteinDPOGS215768-PA418 aa
Genomic positionDPSCF300041 + 1564139-1566100
RNAseq coverage1398x (Rank: top 9%)
Annotation
HeliconiusHMEL0141070.082.10% 
BombyxBGIBMGA003645-TA0.075.18% 
DrosophilaCG8446-PB1e-10551.41% 
EBI UniRef50UniRef50_D6W6Q32e-11954.13%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6W6Q3_TRICA
NCBI RefSeqXP_975811.14e-12054.13%PREDICTED: similar to GA21085-PA isoform 2 [Tribolium castaneum]
NCBI nr blastpgi|3838571028e-12159.01%PREDICTED: lipoyltransferase 1, mitochondrial-like [Megachile rotundata]
NCBI nr blastxgi|3838571021e-11759.01%PREDICTED: lipoyltransferase 1, mitochondrial-like [Megachile rotundata]
Group
Gene OntologyGO:00064641.7e-19protein modification process
GO:00038241.7e-19catalytic activity
KEGG pathwaydpo:Dpse_GA210857e-107 
 K10105 (LIPT1)maps-> Lipoic acid metabolism
InterPro domain[95-205] IPR0041431.7e-19Biotin/lipoate A/B protein ligase
Orthology groupMCL13426 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215768-TA
ATGGCTCAATCAGTGCTAAGGAAAATCGCCATGAGCAACATGTTAGTGATGGTCGGTTTGACAAGACGAACCAGCTCTCGAAGTTTGACAACAGGTAAAAAGTTGGCTACAACCAAACCCAACAGTATCTTACCACCAGAGAGGGAAGTCACAAAGTCAGTATTTATGTCTCAGTCTACAGACATATACACAAACCTGGCTTTAGAAGATTGGATGTACCGCAATATGGATTTCAGTAACCACCATGTGATGATGGTGTGGCGTAATGAGCCATGTGTTGTCATCGGTAGACATCAGAATCCGTGGCTTGAAGCTAATGTTCCATTTCTGAATGATAAAGATATACCTCTGGCTCGACGCCACAGTGGAGGTGGAACTGTGTACCATGATCGTGGAAACCTCAACATCTCCTTTTTCACTCCAAGAGACAGACATGACAGAAAATATAACTTAGAAATCATAAAGAGAGCACTCTACAGAGATTTTGGAATTAAGGCCCTTATTAATGAACGTCATGACATTCTTGTCAGAGACAAGTACAAGATCTCTGGAACAGCTGCGAAACTGGGTCGTACATCAGCATATCATCATTGCACACTTCTAGTCAATGCTAACAAAGCAGATCTCAGTAAGGCTCTAGCAAAGAGAGAGCATGGCATACAGACAAAGGCGACTGCGTCGACGCCTTCCCCCGTGGTCAATTTGGTGGACGTGAACAATAGAGTTACTGTGGACAGTCTACAAATTGCTGTGGGTTATGAGTTCCTCAGAACTCCAGCACTTCATTTGGAAGACGGCGGTGAAAACCAGATCTCAAAACAAAGAGGATTCCAGTTTATAAATCCAACTGAAGACTGGTTCCCAGGACTTTCAGAATTGAAGCATGAACTGCAAACATGGGAATGGTCATTTGGCAGAACTCCAGAGTTCACCGTGAGCAGATCTTTCCCTGTGCCGGAAGATTTACTTGCTCCTAGTAAGGTTTACTCGGCTACTCAAGAGCTAACTATTACTATGACGGTCGAGAAAGGCCTGATCAATGACGTCACACTGAATATACCGCCGGGTTTGGTTGAATCTGGCTTCCACGGGGAAGCCTCTGTCATTACACACCTTAAAGGCAAGAGGTTCACATCAGAGGCCCTCACTGCCCTTCAGGAGGCGATGCTTCGTCACTCCGATGGTGAAGTCAGGAAATTAGATGACAAAGAGCAGTTTGTAGCTAAATGTTTTGACCAAGTCGTGAACACTATCTAG

Protein sequence:

>DPOGS215768-PA
MAQSVLRKIAMSNMLVMVGLTRRTSSRSLTTGKKLATTKPNSILPPEREVTKSVFMSQSTDIYTNLALEDWMYRNMDFSNHHVMMVWRNEPCVVIGRHQNPWLEANVPFLNDKDIPLARRHSGGGTVYHDRGNLNISFFTPRDRHDRKYNLEIIKRALYRDFGIKALINERHDILVRDKYKISGTAAKLGRTSAYHHCTLLVNANKADLSKALAKREHGIQTKATASTPSPVVNLVDVNNRVTVDSLQIAVGYEFLRTPALHLEDGGENQISKQRGFQFINPTEDWFPGLSELKHELQTWEWSFGRTPEFTVSRSFPVPEDLLAPSKVYSATQELTITMTVEKGLINDVTLNIPPGLVESGFHGEASVITHLKGKRFTSEALTALQEAMLRHSDGEVRKLDDKEQFVAKCFDQVVNTI-