Monarch geneset OGS2.0

DPOGS212431
TranscriptDPOGS212431-TA1455 bp
ProteinDPOGS212431-PA484 aa
Genomic positionDPSCF300258 + 103353-108415
RNAseq coverage346x (Rank: top 34%)
Annotation
HeliconiusHMEL0123580.074.49% 
BombyxBGIBMGA002889-TA0.071.81% 
DrosophilaCG17683-PA2e-14954.12% 
EBI UniRef50UniRef50_Q8SYS73e-14754.12%Probable cytosolic Fe-S cluster assembly factor CG17683 n=34 Tax=Arthropoda RepID=NARF_DROME
NCBI RefSeqXP_001605725.13e-16961.48%PREDICTED: similar to ENSANGP00000006535 [Nasonia vitripennis]
NCBI nr blastpgi|1565455535e-16861.48%PREDICTED: probable cytosolic Fe-S cluster assembly factor AAEL012261-like [Nasonia vitripennis]
NCBI nr blastxgi|1565455534e-16161.48%PREDICTED: probable cytosolic Fe-S cluster assembly factor AAEL012261-like [Nasonia vitripennis]
Group
KEGG pathwayafm:AFUA_4G119603e-55 
 K00532 (E1.12.7.2)maps-> Methane metabolism
    Glyoxylate and dicarboxylate metabolism
InterPro domain[68-472] IPR0090161e-113Iron hydrogenase
[112-414] IPR0041081.1e-82Iron hydrogenase, large subunit, C-terminal
Orthology groupMCL14389 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212431-TA
ATGGCTTCGCGGTTTAGTGGGGCGCTACAATTGACCGACCTTGATGATTTTATTACCCCATCGCAGGAATGTATAAAACCAGTGAAAATAGAGAAAAAGAAAACTCACACGGGATCCAAAATAAAAATCGGCGAAGACGGATACTTTGACCTTTCATCTGGAAAGGAACAAAAGCTTCAGAAAGTGGAGATCACTCTTGCTGATTGTCTTGCTTGTAGCGGCTGTATCACGTCAGCGGAGAGTGTACTCATTACAAAACAGAGCCAAGAAGAGTTACTTAGAGTATTCTCTGAGCGAAAATACACAGACAGCCGAGGCGTTATACAAGATGTCAGTCTCATTGTGATATCAATTTCTCCGCAACCTCTACTCTCACTAGCTGTGAGATATAAGCTAGAGCCAGAAGAAGCTACTAGAAAATTAGCCGGTTACTTCAGGAGTCTGGGCGCAGACCTGGTGCTGGACATGACAGTGGCTGAAGACCTGTCCCTGATGGAAGCCCAGCAGGAGTTTGTCCAGCGGTATAGAGATCAAGCAGACTCAGATGTTAAGACACTGCCAATGTTAGCCAGTGCTTGTCCAGGCTGGGTGTGCTACGCTGAGAAAGCGCACGGCAGCTACATCCTCCCTTACATCTCCACCACCAAATCCTCGCAACAAGTCATGGGGTCGCTGGTGAAGCAGTTCCTCGCTACCAAGAGACAGCTCGCGCCGGCTGCCCTCTACCACGTGACTCTGATGCCCTGCTATGACAAGAAGTTGGAGGCTTCCAGGGAGGACTTCTACAACGAGATATTGAACTGTCATGATGTGGACTGTGTCATAACACCCATCGAGTTGGAGCAAATGCTGACCAACCAGGACAAGGATCTGTCAGACTTCCCGGACAGTTCTCTGGACTGGTGCTGGGATGTGGCGATGACGCCGGGTGTGAGGCGCCACGGGGGCCGGGGGGCGGGGTCCTCGGGCTCCGGGGGACTCGCGGACGAGGTGTTCATGTACGCGGCCAGGGAGCTCTTCGGGGAGGAGGACGTGCCGCTCGTCTACAAGAACCTCAGGAATCCCGACTTCCGGGAAATAACTTTGGAGAAGGATGGCCGGGAGGTCCTGAGGTTCGCCATCGCCAACGGCTTCCGGAACATACAGAACCTGGTGCAGAAACTGAAGAGGGGCAAGTCTCCCTACCACTACGTGGAGGTCATGGCCTGCCCTTCAGGTTGTCTGAACGGCGGCGCCCAGGTGCGACCAACCGAGGGTGAGAGCGGTCGCGCGCTGGTGGGGAGGCTGCAGGAGCTGATGGAGACTCTCCCGCCCGCGGAGCCCTCCGGGACCGCGGTTAGACACCTCTGGAGCGCCTGGCTCGGGGCTGCGGGCCCGGAGCGAGCGAGACACGCGCTACACACCACCTACCACGCTGTGCAGAGTAACGACATCGCACTCACCACCAAGTGGTGA

Protein sequence:

>DPOGS212431-PA
MASRFSGALQLTDLDDFITPSQECIKPVKIEKKKTHTGSKIKIGEDGYFDLSSGKEQKLQKVEITLADCLACSGCITSAESVLITKQSQEELLRVFSERKYTDSRGVIQDVSLIVISISPQPLLSLAVRYKLEPEEATRKLAGYFRSLGADLVLDMTVAEDLSLMEAQQEFVQRYRDQADSDVKTLPMLASACPGWVCYAEKAHGSYILPYISTTKSSQQVMGSLVKQFLATKRQLAPAALYHVTLMPCYDKKLEASREDFYNEILNCHDVDCVITPIELEQMLTNQDKDLSDFPDSSLDWCWDVAMTPGVRRHGGRGAGSSGSGGLADEVFMYAARELFGEEDVPLVYKNLRNPDFREITLEKDGREVLRFAIANGFRNIQNLVQKLKRGKSPYHYVEVMACPSGCLNGGAQVRPTEGESGRALVGRLQELMETLPPAEPSGTAVRHLWSAWLGAAGPERARHALHTTYHAVQSNDIALTTKW-