Monarch geneset OGS2.0

DPOGS200096
TranscriptDPOGS200096-TA1779 bp
ProteinDPOGS200096-PA592 aa
Genomic positionDPSCF300044 + 69263-78153
RNAseq coverage177x (Rank: top 50%)
Annotation
HeliconiusHMEL0098096e-14571.66% 
BombyxBGIBMGA000661-TA0.072.10% 
DrosophilaCG9302-PA2e-13250.84% 
EBI UniRef50UniRef50_UPI0001CBB7AD6e-14447.99%UPI0001CBB7AD related cluster n=1 Tax=unknown RepID=UPI0001CBB7AD
NCBI RefSeqXP_970942.14e-16851.97%PREDICTED: similar to AGAP010217-PA [Tribolium castaneum]
NCBI nr blastpgi|910944858e-16751.97%PREDICTED: similar to AGAP010217-PA [Tribolium castaneum]
NCBI nr blastxgi|910944857e-17651.97%PREDICTED: similar to AGAP010217-PA [Tribolium castaneum]
Group
Gene OntologyGO:00454545.4e-28cell redox homeostasis
GO:00150354.8e-06protein disulfide oxidoreductase activity
GO:00090554.8e-06electron carrier activity
GO:00066624.8e-06glycerol ether metabolic process
KEGG pathway 
InterPro domain[228-351] IPR0123361.5e-34Thioredoxin-like fold
[231-333] IPR0137665.4e-28Thioredoxin domain
[249-257] IPR0057464.8e-06Thioredoxin
Orthology groupMCL16865 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200096-TA
ATGGTTCTGTATATCAATAATGTAAAAGCTACTCAGTCTGTTTTGGATGTATTCAAAGAATCTGCTGGTAATATGAAGGGGCAAGCAACTTTGGTTGCTATTGACTGCAGTAACAGTGATGGTAAGAAGTTATGTAAAAAGTTGAAAGTTCCATCTGACAAGTCCTATATACTCAAACATTACAAGGATGGGGAATTCCATAAGGATTATGATCGTGGCATTTCTGTAAGTGCTATGGTGAACTTCCTGAGAGATCCAACTGGGGATCTGCCATGGGAGGAGGATCCAAATGCTACAGACATCATACATCTTATTGATGCAGAGGCAAGTGCATTAAACAAGTTCCTTAAGAAAGGCATTGCTACATACAAAAAGGCAATGATCATGTTCTATGCTCCGTGGTGTGGTTATTGTAAATCTTTGAAGCCTGATTATGTTGCTGCAGCAGCTGACTTAAAGGGAGAAGCATTCCTAGCTGCGATAGATGTGTCTAAACCTGGTAACTCTAAGATAAGACAAGTGTATAACATAACTGGTTTCCCGACTTTGTTGTTCTTTGAGAAAGGTCAATATCGATTCCCTTATAATGGAGACAACAAACATAAAGCAATTGTAAACTTCATGAGGGACCCGACGTCACAGATGGTTAAAAAGGAACCAGTAGATGAAAGTTGGTCTACTGATTCAGATGTTATACATTTAACAGAGAGCACATTTGACAGTGTCCTATCAAAAGCTGAACACGCCCTCGTGGTCTTTTACGCACCGTGGTGTGGTCATTGTAAGAGAATTAAACCTGAGTTCGAGAAAGCTGCCACTAAGATTAAAAGAGAAAAAATAAACGGTGTGCTTGCTGCTGTGGACGCCACCCAGGAATCCAGTTTAGCTTCTCGCTTCGGGGTGAAAGGTTACCCCACATTGAAATACTTCAGTAAGGGGGAGTATAAGTACGACGCCGGTCATGCTCGCCAAGAGGAACAGATTATCGAATTTATTAAGTCGCCCCAGGAGCCGCCGCCGCCGCCGCCGCCCGAGGTCCCCTGGTCCGAGCAGGAGTCGTCCGTGCGCCACCTCGACACCGCGACCTTTAAGAACACGCTGCGGAAGATCAAGCACGCGCTCGTCATGTTCTACGCCCCATGGTGCGGTCACTGCAAAAGTACAAAACCAGAGTTTGTGAAGGCCGCAGATAAATTCGCTGATGAACTGATAATAGCGTTTGGTGCTGTTGACTGTACTGTACACAAAGATGTGTGTGCCAACTATGACGTCAAAGGTTATCCCACCATCAAGTACTTTAGTCATTTCGACAAAGTAGTTCAGGATTATACCGGAGGAAGAAAGGAAGCAGATTTCGTATCATTTATCAACAATCAGTTGGACAGACAACAGTTATCACAGAAGGCTAAGAGCAATCAGGAAGCTGGTTTTGGTACAAACGTGCAACTAGCTGATGATAGTGACTTCACTGACATCATTGCTAATGATAAACCCACGTTTGTCATGTTCTATGCTACTTGGTGTGGCCATTGTTCGACCGTGAAACCAGCTTTTAGTCGACTAGCCACGTCTTTAAAGGAAGGGAACGGCAGAGCCGTAGCCATAGCTGTGGATGCAGCCGAAAATCCAAAGGTCGCTGACCTTGCTTCCATACAAACACTACCAACATTCAAAATATTCAAAGCCGGCCAATATCTAGCGACTTACGAAGGTGATAGATCATTTGAAGACATGATGAATTTCGTACAATCTTATATCAAAATGAAGGATGAATTGTGA

Protein sequence:

>DPOGS200096-PA
MVLYINNVKATQSVLDVFKESAGNMKGQATLVAIDCSNSDGKKLCKKLKVPSDKSYILKHYKDGEFHKDYDRGISVSAMVNFLRDPTGDLPWEEDPNATDIIHLIDAEASALNKFLKKGIATYKKAMIMFYAPWCGYCKSLKPDYVAAAADLKGEAFLAAIDVSKPGNSKIRQVYNITGFPTLLFFEKGQYRFPYNGDNKHKAIVNFMRDPTSQMVKKEPVDESWSTDSDVIHLTESTFDSVLSKAEHALVVFYAPWCGHCKRIKPEFEKAATKIKREKINGVLAAVDATQESSLASRFGVKGYPTLKYFSKGEYKYDAGHARQEEQIIEFIKSPQEPPPPPPPEVPWSEQESSVRHLDTATFKNTLRKIKHALVMFYAPWCGHCKSTKPEFVKAADKFADELIIAFGAVDCTVHKDVCANYDVKGYPTIKYFSHFDKVVQDYTGGRKEADFVSFINNQLDRQQLSQKAKSNQEAGFGTNVQLADDSDFTDIIANDKPTFVMFYATWCGHCSTVKPAFSRLATSLKEGNGRAVAIAVDAAENPKVADLASIQTLPTFKIFKAGQYLATYEGDRSFEDMMNFVQSYIKMKDEL-