Monarch geneset OGS2.0

DPOGS201897
TranscriptDPOGS201897-TA1779 bp
ProteinDPOGS201897-PA592 aa
Genomic positionDPSCF300191 + 453461-457805
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0147454e-16164.80% 
BombyxBGIBMGA006070-TA3e-11267.56% 
DrosophilaCG18130-PA1e-3945.16% 
EBI UniRef50UniRef50_Q7Q0634e-6031.80%AGAP012275-PA n=1 Tax=Anopheles gambiae RepID=Q7Q063_ANOGA
NCBI RefSeqXP_001652618.15e-6332.49%hypothetical protein AaeL_AAEL007253 [Aedes aegypti]
NCBI nr blastpgi|1571155791e-6132.49%hypothetical protein AaeL_AAEL007253 [Aedes aegypti]
NCBI nr blastxgi|1892375102e-6833.64%PREDICTED: similar to CG14221 CG14221-PA [Tribolium castaneum]
Group
Gene OntologyGO:00150351.1e-60protein disulfide oxidoreductase activity
GO:00090551.1e-60electron carrier activity
GO:00066621.1e-60glycerol ether metabolic process
GO:00454541.1e-60cell redox homeostasis
KEGG pathway 
InterPro domain[67-223] IPR0057461.1e-60Thioredoxin
[67-178] IPR0123361.7e-15Thioredoxin-like fold
[69-163] IPR0137663.1e-08Thioredoxin domain
Orthology groupMCL24821 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201897-TA
ATGTATAATCCAATCTGCCTCATCCATAACCCTATCAATGGACGTCACCGCCATCCCGGGCGCACCTTCCCCCCCCCCTTCCCCTCTGCTGGTCGTTCACTTACCATCATGTCCGTAACGAGCGCGGTCGCGAACGCCGCCGCCGCCGCCGCCGCGGCCGGCACAGGCAAGAAGGCCGCTCAGGTACAGTTACAGGCGGAGCTGAACAACGACGATGAATGGAACAAGTTTCTTCTACGAGACGGACTCCTCGTGATCGACGTCTACACGGAGTGGTGCGGCCCGTGCATAGGAATGGTGGGGAATCTGAAGAAAATCAAAGTTGAGATCGGAGGAGATAATTTACATCTGGCGGTGGCGAAGGCGGACACCATTGGATGTCTGTCTAGATTCAGGAACCGCAGCGAACCAACTTGGATGTTTATTTCTGGTGGTCAATTAATTAATGTGGTGTTCGGCGCGGACGCTCCTCGCCTCGCTCGCACGATCGTGGAAGAGCTGAAGAATGAAGAGCTGGTGAAGAAAGGGGAGAGAGAGAGACCGACACGAGCTCCACACGAACTCACTCCACCGGAACAGGAGGTCGCCTTGGCCCAAGCAAAGCTTCTCCAGCTACGCAAAGAAAAAGAGGCGGCGGCTGCGGCAGCGGAACGACTTGAAAGAAGAGAAGCGCGAGCAGTCGCCCTAGAGGTACACTTCAATGACGTGTGTCCCGCGCTTATGATGCCCCACTCACAGAAAAATATACGAAAAGTCTCGGACGCGCTGGAGCCTTACGGAGTAATTGTCGCTGACAAATGCCCATTAGTGCTGGGAAAAGATGGAGCGAAAGTTCTTGGCGTGGAAGATCCTGAATTTGCAAAACCAGAAACCGCGATGGCTTTACTCGAAAGACCAGCACTTGTACTGCTGTTGAAGAAACTACCTGACAAGGAAGGTAGTGTCATCGAACTGGTCCGTCGCGCGATTTATAACGAAGGTATAGAATCAGATGAGGACGACACGAAGAAATCTCTGGCAGAGGAACTGAGGGCTAACGGTATTCCCGGCGTGTTCGTACCGACCGACCGTCATCAGAGAGCTTCCGCACTGGACCTATTCTTCCCGAAGATGGTGTCGGCGGTGGCGGCTCCGCTGACGGCCCCGGAGCCTCCGCACGTGGCGATGATACTGGGAGCGTGGCAGAGACGAGCCGTGCTCAATATCATCGCCAGCAAGCTGCCCTCCAGGCTCCTGAGATATGGCTTCTTTAAAGACGCCGACGTCGAGCAACCGACACTACTGTGCAAGACCATCGACCAGTATGAGGAACGACCGGAGAAAGACTTTTCGGAGACTATCGTGCTGATGATATCGGTGGGTGTGACGGATCCTGGCGCTGAGGGGGCGCCGGTGACGGAGGGAGTTCCACACGAGCTCCTCTCACTAGGACCTCTGTGGGTCAGCGAGGATGCGGTGCTGGGGAAAGAGGAATGCGCGAGGTTCTTCCCCCCGGGGTACAGCGAGCCGGAGAAGAAACCCGGCCCCAAACCCAAGAAGAAGAAGAAGAAGCGTCACGACACTAGAGAAGAAACCGCGGACAACGTGGATGCGCCCCCCGGGACTGCGCCGGACACGGAGGCGGGAGATGGATCCGTGGAGGGAGACCCTGACCCGGAGGGGGAGGAGGAGGGGGAAGAGAGGGAGGAGGAGGGGCAGGGAGACGGAGACGGGGAAGCGGAGGAGGGGGAGGAGTTACTCCTGGACAAGGGAACCTCGCCGCCACCAGTCAACGATTAG

Protein sequence:

>DPOGS201897-PA
MYNPICLIHNPINGRHRHPGRTFPPPFPSAGRSLTIMSVTSAVANAAAAAAAAGTGKKAAQVQLQAELNNDDEWNKFLLRDGLLVIDVYTEWCGPCIGMVGNLKKIKVEIGGDNLHLAVAKADTIGCLSRFRNRSEPTWMFISGGQLINVVFGADAPRLARTIVEELKNEELVKKGERERPTRAPHELTPPEQEVALAQAKLLQLRKEKEAAAAAAERLERREARAVALEVHFNDVCPALMMPHSQKNIRKVSDALEPYGVIVADKCPLVLGKDGAKVLGVEDPEFAKPETAMALLERPALVLLLKKLPDKEGSVIELVRRAIYNEGIESDEDDTKKSLAEELRANGIPGVFVPTDRHQRASALDLFFPKMVSAVAAPLTAPEPPHVAMILGAWQRRAVLNIIASKLPSRLLRYGFFKDADVEQPTLLCKTIDQYEERPEKDFSETIVLMISVGVTDPGAEGAPVTEGVPHELLSLGPLWVSEDAVLGKEECARFFPPGYSEPEKKPGPKPKKKKKKRHDTREETADNVDAPPGTAPDTEAGDGSVEGDPDPEGEEEGEEREEEGQGDGDGEAEEGEELLLDKGTSPPPVND-