Monarch geneset OGS2.0

DPOGS200397
TranscriptDPOGS200397-TA1149 bp
ProteinDPOGS200397-PA382 aa
Genomic positionDPSCF300121 - 170713-179491
RNAseq coverage13x (Rank: top 82%)
Annotation
HeliconiusHMEL0075001e-13989.34% 
BombyxBGIBMGA014023-TA8e-13782.33% 
Drosophiladpr20-PA1e-7054.36% 
EBI UniRef50UniRef50_Q9W0L92e-6854.36%Dpr20 n=18 Tax=Neoptera RepID=Q9W0L9_DROME
NCBI RefSeqXP_002067868.13e-7047.39%GK12675 [Drosophila willistoni]
NCBI nr blastpgi|1954400425e-6947.39%GK12675 [Drosophila willistoni]
NCBI nr blastxgi|1955867959e-6751.88%GD13504 [Drosophila simulans]
Group
KEGG pathway 
InterPro domain[193-316] IPR0137834.9e-11Immunoglobulin-like fold
[115-198] IPR0130981.7e-09Immunoglobulin I-set
[115-212] IPR0035998.2e-09Immunoglobulin subtype
[121-202] IPR0035985.4e-07Immunoglobulin subtype 2
Orthology groupMCL17435 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200397-TA
ATGTGGAGTGTGGGAGTATTTTTATTTACTCAGATAGCCGTCTGTCAATTTACAACAGAAATATCAACGTCAACAACAGAAGAAGATCAAACTGTCACTCCATACTACGCTCTCAGTACTGGTGTCGTACCAGCTCCTGTGATGCTTCGCCTTGGAAACTCAACGATCGATAGCACAAAATCATTCAAAGTACCTCCACCTTCACAAGAAACGGATACAGGGACCACAACATCACCACCAAATGCAAACGCAGAAATGGTACCACATGTAGTACCAGTCAAAAACACTTATTTCGATCATGACATAAGATATGGTCCAACATTTGAGGATTCGAATGCAAATATGACGAAGATAACGATACAGCTGGGTGAAGACGCTCATCTGAATTGCAGGATTAGCCTCCTGCAAGATAAAACGGTTTCCTGGGTACGGCGTAGAGGTAGAGATGAGATACCCGAACTTCTAACAGTTGGTGCAGTGACATATGCAGCAGACATGAGAGTATCAGTTGGTAAACGCTATCCAGGGAACTGGAGATTGCTGATAAGGGAAGTGAAACCCGATGACGAAGGGGTTTACGAATGCCAGATATCAACACATCCACCACGAGTTAGCAGAACCTACTTGCATGTTAATACCCCGCAAGTTTGGGTGGTAGACGAAGCCGGAGGTCCGTTGCTAGAGAAGTACTATGAAGCGGAATCTACGTTAGCACTTATGTGTCGAGCAAGATATGTTGAAACACCATCAGTTCTCACTTGGCTTCATGAGGGCAGAGCTCTCAATTCTGACACGACTAGAGGAGGAATTAGTGTAAAAACAGAGCAAGTACCGGGCGGGGCGGACAGTGTGCTGCGTCTAGCTCGCGTCAATAGCAGTGACGCTGGCAACTACACGTGCGCTGTGCGAGGAGCTCGACCCCACACAGTAGCTGTGCATGTGCTTAATGAAGAAAGCCTAGCTGAGCTACACGCCGGAGTGACATCACTTAACCCGTCACTGCAAACTGTTACGTTATCCGTTTTAACTTTAATAATTATGCAATCTACGTTTATGTCTATTCAGATCTCTTTCACGATAATAGCATATAGACTTAGAGCCCATGGCAATACTATTGAATATTATCCTGTTATTCTGTTACCAACGTGA

Protein sequence:

>DPOGS200397-PA
MWSVGVFLFTQIAVCQFTTEISTSTTEEDQTVTPYYALSTGVVPAPVMLRLGNSTIDSTKSFKVPPPSQETDTGTTTSPPNANAEMVPHVVPVKNTYFDHDIRYGPTFEDSNANMTKITIQLGEDAHLNCRISLLQDKTVSWVRRRGRDEIPELLTVGAVTYAADMRVSVGKRYPGNWRLLIREVKPDDEGVYECQISTHPPRVSRTYLHVNTPQVWVVDEAGGPLLEKYYEAESTLALMCRARYVETPSVLTWLHEGRALNSDTTRGGISVKTEQVPGGADSVLRLARVNSSDAGNYTCAVRGARPHTVAVHVLNEESLAELHAGVTSLNPSLQTVTLSVLTLIIMQSTFMSIQISFTIIAYRLRAHGNTIEYYPVILLPT-