Monarch geneset OGS2.0

DPOGS210000
TranscriptDPOGS210000-TA1263 bp
ProteinDPOGS210000-PA420 aa
Genomic positionDPSCF300247 + 135572-161615
RNAseq coverage23x (Rank: top 78%)
Annotation
HeliconiusHMEL0133509e-7849.45% 
BombyxBGIBMGA006372-TA2e-9252.40% 
DrosophilaCG34391-PC6e-15179.15% 
EBI UniRef50UniRef50_A8JNC78e-14979.15%CG34391, isoform C n=26 Tax=Neoptera RepID=A8JNC7_DROME
NCBI RefSeqXP_969598.25e-15878.84%PREDICTED: similar to AGAP004915-PA [Tribolium castaneum]
NCBI nr blastpgi|2700146162e-16983.92%hypothetical protein TcasGA2_TC004659 [Tribolium castaneum]
NCBI nr blastxgi|2700146161e-16583.92%hypothetical protein TcasGA2_TC004659 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[249-320] IPR0137834.9e-20Immunoglobulin-like fold
[141-213] IPR0130981.2e-13Immunoglobulin I-set
[147-214] IPR0035987e-13Immunoglobulin subtype 2
[36-131] IPR0035994.5e-07Immunoglobulin subtype
Orthology groupMCL17163 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210000-TA
ATGACCGGACGACGCGGACCCTTCAGGAGTTTCGCTATCAGCATCATTCAAATTATCACCATCATATGTCAAGTGTTGACTGAAGAGCCACGTTTTGCGGAACCCATACCTAACGTGACGGTAGCGCTCGGGAGAGATGCCAGCTTACCTTGCGTGGTTGAACACCTCGGCACCTACAAGGTGGCATGGATTCACATCGATCGTCAAATGATCCTGACCATCCACCGTCACGTGATCACCCGCCTCGCCAGATTCAGCGTCTCACACGACAACGCGATGACCTGGTTGCTCCACGTTAGCCAAGTACAGCAAGAAGACCGAGGGTATTACATGTGCCAAGTGAATACGAATCCAATGATCAGTCAAGTTGGATACTTACAAGTTGTTGTACCCCCGAATATATTAGATGAAGAAAGCACACAGTCAGCAGTGGCAGTTAGAGAAAACCAGAATATTAGCCTTATTTGTAAGGCAGATGGCTTCCCGACACCGAAAATTATGTGGCGAAGAGAAGATGGCCAGCCTATATCCGTTGACAGGAGAAAGAAAGTAACAGTCTACGAAGGAGACACGCTAAGTCTACAACGCATCAGTCGCACAGAGATGGGAGCGTACCTCTGCATCGCAACCAACGCGGTGCCACCCTCCGTCTCCAAGAGGATCATTGTGGATGTTGAATTTTCTCCCATGATCTGGGTACCCAACCAGCTAGTCGGCGCGCCTGCCGGCACTGACGTTACCGTGGATTGCCATACAGAAGCTCATCCACGAGCGATCTCATATTGGGTATACGATAGTGTTATGGTTCTACCAACCAAGAAATACGCCATCAACACAGAGGAAAACTCATACAGGGCCCACATGAAGCTGACTGTCAGAAATCTCCAAAATGGCGACTTTGGCAATTACAGATGCATTTCCAAAAATTCTCTCGGAGAAACCGAAGGGTCTATCAGATTGTATGAAATCCCGATGCCTTCGACGTCGCCTAAAGCTACAGAAATGAAGAGCAACGCCAATAAAGAAATCGTGCGTCGCATGAACGTGACGCGTGCGGGTTCTCACGAGTCGGTGACCGAGCGCCCAAGTGTGGTGCGCGCTCAGCTTGACCGAGCACCGGACCGCGGGCATGTCTACCGCGCGCCACATCCTCACCAGGCATCAGGTACCCGGAGTCTGCTATGTTGGCGTCAATCTTTCTTGGCTGTAATGATACTGGCTAATATGGACATCATTTCCGAATTCTTAATGTTATGTTTTTAA

Protein sequence:

>DPOGS210000-PA
MTGRRGPFRSFAISIIQIITIICQVLTEEPRFAEPIPNVTVALGRDASLPCVVEHLGTYKVAWIHIDRQMILTIHRHVITRLARFSVSHDNAMTWLLHVSQVQQEDRGYYMCQVNTNPMISQVGYLQVVVPPNILDEESTQSAVAVRENQNISLICKADGFPTPKIMWRREDGQPISVDRRKKVTVYEGDTLSLQRISRTEMGAYLCIATNAVPPSVSKRIIVDVEFSPMIWVPNQLVGAPAGTDVTVDCHTEAHPRAISYWVYDSVMVLPTKKYAINTEENSYRAHMKLTVRNLQNGDFGNYRCISKNSLGETEGSIRLYEIPMPSTSPKATEMKSNANKEIVRRMNVTRAGSHESVTERPSVVRAQLDRAPDRGHVYRAPHPHQASGTRSLLCWRQSFLAVMILANMDIISEFLMLCF-