Monarch geneset OGS2.0

DPOGS203835
TranscriptDPOGS203835-TA1479 bp
ProteinDPOGS203835-PA492 aa
Genomic positionDPSCF300010 + 2593909-2600046
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0069507e-5252.25% 
BombyxBGIBMGA003741-TA1e-6634.16% 
DrosophilaCG41378-PB1e-2033.33% 
EBI UniRef50UniRef50_Q1HPH44e-3138.46%Legumaturain n=2 Tax=Obtectomera RepID=Q1HPH4_BOMMO
NCBI RefSeqNP_001040501.17e-3238.46%legumaturain [Bombyx mori]
NCBI nr blastpgi|1140530351e-3038.46%legumaturain [Bombyx mori]
NCBI nr blastxgi|1140530351e-3136.45%legumaturain [Bombyx mori]
Group
KEGG pathwaytet:TTHERM_003738209e-25 
 K08059 (IFI30, GILT)maps-> Antigen processing and presentation
InterPro domain[305-483] IPR0049112.3e-38Gamma interferon inducible lysosomal thiol reductase GILT
[24-214] IPR0123364.7e-08Thioredoxin-like fold
Orthology groupMCL21020 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203835-TA
ATGGATAAGAATCTTTTTAGTCAACAACTTTGTTTGGACTATCGTGATTTCAAGTTAAATTATGTGTCGCCAATAAAACGCGCGAATACTTTAATGTCTACACTATTCAAGGATTTAGGGCAAAAAGAAAAAGTACGTATAGATCTGTATTATGAAGTCTTTTGTCCCCATTGCATAAACTTTGACGAAAATCAGTTTGGTCCCCTCGTAGAGAGTATGGGTGATTACCTTGATGTACATACATATCCTTATGGCAACGCTAAGACAATAGAGGAAAAAGGAAACATTTCCTTCCTATGTCAACACGGACCCCCTGAGTGCTATGGTAACAAACTCCACGCCTGTGCACTCGATAATTTGGAGCACTCCAAGGCACTTCTTTTTAATATTTGCCTGATGAACAGCACCGAGGGTGGAGGTTCTGATGACAAAACAGCTGATGAGTGCGGGAAAAACATGAACGTTGATTCAAAACCAATAAAACTTTGTGCTAAAGGTAATAGGGGTACAGAACTTTTAAAGTATTATGGAGAGGAAAGTAAGAAAGCGAATTTTCATCATGTGCCATATGTTCTCATAAACGGCAAGAAGTTTAATAATAGTGACTATTTTAAGAAAACAGTTTGCGAAGCCTTCAAAAATTCACCGCCGCCATGTCAAAGTATCGAGACAAACGATTTAAATGAAAGAGATTTCGCTATGAGATTGTTGAGCAATGACTACTATTTAACACAATTCAAGTTCATTCCAATCTACTCAGAGAAGGTAGATATAAAAATATATTACGAATGTCTTTGCCCTGATTGTATAAAATTTGACGTAGAGCAATTCAGTCCAGTCCTGGAAACTATGAATCAGTATTTAGAGATACACACTTACCCTTATGGAAACGCCAAGTTCATTCCAATCTACTCAGAGAAGGTAGATATAAAAATATATTACGAATGTCTTTGCCCTGATTGTATAAAATTTGACGTAGAGCAATTCAGTCCAGTCCTGGAAACTATGAATCAGTATTTAGAGATACACACTTACCCTTATGGAAACGCCAAGACTATAAAAAAAAATGGGAAAATCGAATTTATATGTCAACACGGACCCGCTGAGTGCTACGGTAACAAACTGCATGTGTGTGCTATCGACAGTTTGCAACACATTACTGCATTACGGTTTAACCTTTGTTTGATGAACAGCACAGAGGGTCGTGGTTCAGATGATAAAATGGCTGACAAGTGTGGACAATTAATGGGTTGTGATTCGGAAGCAATAAAAGCATGTGCCAGAAGCAATAGAGGTACAGAGCTATTGCTATATTATGGGGAGCAGAGTAAGATGGCGAATTTCAATTATGTTCCATACGTACTAATAAACGGTAAGGTGTATAACACTGATGGAAATAAGGACTTCAAAGATGCTGTTTGTGCTGCATTCGATAATCCACCGCCACCTTGTACAAATAAATATCTGGTAGACAAATAA

Protein sequence:

>DPOGS203835-PA
MDKNLFSQQLCLDYRDFKLNYVSPIKRANTLMSTLFKDLGQKEKVRIDLYYEVFCPHCINFDENQFGPLVESMGDYLDVHTYPYGNAKTIEEKGNISFLCQHGPPECYGNKLHACALDNLEHSKALLFNICLMNSTEGGGSDDKTADECGKNMNVDSKPIKLCAKGNRGTELLKYYGEESKKANFHHVPYVLINGKKFNNSDYFKKTVCEAFKNSPPPCQSIETNDLNERDFAMRLLSNDYYLTQFKFIPIYSEKVDIKIYYECLCPDCIKFDVEQFSPVLETMNQYLEIHTYPYGNAKFIPIYSEKVDIKIYYECLCPDCIKFDVEQFSPVLETMNQYLEIHTYPYGNAKTIKKNGKIEFICQHGPAECYGNKLHVCAIDSLQHITALRFNLCLMNSTEGRGSDDKMADKCGQLMGCDSEAIKACARSNRGTELLLYYGEQSKMANFNYVPYVLINGKVYNTDGNKDFKDAVCAAFDNPPPPCTNKYLVDK-