Monarch geneset OGS2.0

DPOGS212082
TranscriptDPOGS212082-TA1275 bp
ProteinDPOGS212082-PA424 aa
Genomic positionDPSCF300038 - 1235113-1245086
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0125772e-6972.32% 
BombyxBGIBMGA006714-TA2e-14266.50% 
Drosophiladpr20-PA1e-5336.84% 
EBI UniRef50UniRef50_E2ABD74e-5440.88%Obscurin n=2 Tax=Formicidae RepID=E2ABD7_CAMFO
NCBI RefSeqXP_001601574.14e-5642.47%PREDICTED: similar to CG12191-PA [Nasonia vitripennis]
NCBI nr blastpgi|3454871981e-5442.47%PREDICTED: hypothetical protein LOC100117285 [Nasonia vitripennis]
NCBI nr blastxgi|3454871981e-5339.79%PREDICTED: hypothetical protein LOC100117285 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[187-281] IPR0137831e-10Immunoglobulin-like fold
[186-270] IPR0131061.5e-08Immunoglobulin V-set
[297-374] IPR0130981.6e-06Immunoglobulin I-set
[185-282] IPR0035992.8e-06Immunoglobulin subtype
Orthology groupMCL25267 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212082-TA
ATGTGCGCGCCGGCAGCCGCGGCGCTCTTCTTAGCCGCGCTCGCTGCACTTCCATGTAAAGGAACAGATGCAGACACCTCGACAATACGAAACGAAAATAGTGAGAAAAATCCACAGCCAATGCTGAATAAAGTATATCAGATGCTACACACCAGCACTGAACTTCATAACGAAACTAAACTGTCCTTTAATGCCTCACCAGTTTTACACAGTTTCGAAACAAATATAGAAAAATCACGACTAACATGCTGCCAGAATGACACACACAGTTTTTCAAGTGACGATGGTGCTACAGACAACCAGACAGATGAGGACAAGAGAATTAGAAATCCGCGACTTGTAAATGAAATGCCAGTGGAAAATGTCACGTTGCTTCACAATCCCACAGCAGGTACATCAATCGCAAGTCCAGTAGTCATGGAGAGAAAGAGGAAGTCAGGCGTATCTGGTTCAAGTGCCCCTTTACTGAATTACATATTCGATACTTATTCCAACACACATTATCATCGGAATGAAAAAACCAGCAACGGTAATAGTATATACGTAGCAGCCGGTCCCGAAATCGAAGCACTTGTGGGATCCACAGCCCACATGGACTGTAAAGTAGATGCCCTCCACGACAAGCTGGTGTCATGGGTACGTCGAAAAAATGACGAAGAACCAATGGAACTGCTTACAACTGGAACACAGCAGTACACTGCCGATAACAGGTACTCAGCTCGTTTCATCCCTCCTGATATTTGGCGGCTTGAAATAAAAGAAGTTAGGCCGACCGATGCGGCTTTTTATGACTGTCAATTATCCGCCCATCCACCCAGAACAGCTAGGGTTACTTTACGAGTTCCAGAGGTGTCAATTCAGATAGTAGATGGAGCTGGTGCTCCAGTCTCCGAACAAGTCTGCGAACTTGGCAGCACGGTGGCATTGCGATGCGAAGTGAGAGGCCTGCGTATGGAGGGTGGACCTTCATTACTTTGGTATAGGAAAGATGATTTGCTCAACGATGACACAACTAGAGGAGGAATTAGTGTAAGGACGGAATTTGGTCCTAACGGCGCTAGCTCGGTCCTGCGCGTAGCGCGCGTCAGAGGTGACGATGGGGGGCAGTACAGTTGCAGCATCGCACGGACACCCCCTCCACCACCTGCGCCTGCGCATGTTATATTACACATTATCAAAGGAGAGAGTTTAGCGGAGCTACATCAAGGCGTGGGACGTCAGGCCATATCACGATACCTAGTCATTGGTATAGTGGCACAACTTCTTTTGCAATAA

Protein sequence:

>DPOGS212082-PA
MCAPAAAALFLAALAALPCKGTDADTSTIRNENSEKNPQPMLNKVYQMLHTSTELHNETKLSFNASPVLHSFETNIEKSRLTCCQNDTHSFSSDDGATDNQTDEDKRIRNPRLVNEMPVENVTLLHNPTAGTSIASPVVMERKRKSGVSGSSAPLLNYIFDTYSNTHYHRNEKTSNGNSIYVAAGPEIEALVGSTAHMDCKVDALHDKLVSWVRRKNDEEPMELLTTGTQQYTADNRYSARFIPPDIWRLEIKEVRPTDAAFYDCQLSAHPPRTARVTLRVPEVSIQIVDGAGAPVSEQVCELGSTVALRCEVRGLRMEGGPSLLWYRKDDLLNDDTTRGGISVRTEFGPNGASSVLRVARVRGDDGGQYSCSIARTPPPPPAPAHVILHIIKGESLAELHQGVGRQAISRYLVIGIVAQLLLQ-