Monarch geneset OGS2.0

DPOGS203828
TranscriptDPOGS203828-TA1440 bp
ProteinDPOGS203828-PA479 aa
Genomic positionDPSCF300010 + 2430457-2446932
RNAseq coverage83x (Rank: top 64%)
Annotation
HeliconiusHMEL0133500.093.79% 
BombyxBGIBMGA003732-TA2e-18093.01% 
DrosophilaCG31708-PA2e-15366.02% 
EBI UniRef50UniRef50_Q6NNU32e-15065.78%RE04226p n=28 Tax=Pancrustacea RepID=Q6NNU3_DROME
NCBI RefSeqXP_973967.21e-15576.27%PREDICTED: similar to CG31708 CG31708-PB [Tribolium castaneum]
NCBI nr blastpgi|2700075653e-15570.35%hypothetical protein TcasGA2_TC014162 [Tribolium castaneum]
NCBI nr blastxgi|2700075653e-16371.18%hypothetical protein TcasGA2_TC014162 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[309-383] IPR0137836.5e-23Immunoglobulin-like fold
[201-285] IPR0130981.1e-13Immunoglobulin I-set
[207-274] IPR0035984.3e-11Immunoglobulin subtype 2
[97-192] IPR0035994.4e-09Immunoglobulin subtype
Orthology groupMCL13353 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203828-TA
ATGACACGTGAAAGGCGTCGAGAGTGCCGGGTTGCTGCCTCTACAAAAGGCTTACAGATGCGCGTTCCAGAAGATTTTAATGAGGCAACGACTCTGTGTGATTTTATCGCGTATGCCTATAGAGATAATAGGGTGCGTTGTGTGCCAATGCGCGATTTACATCAGTTGAGTCAGCGCAACCGGGTAGAAAATGGAGCATCGTGCTTAGCTAATACGATCAGGCGAACAAACACGCTACACAGCTTTCATCTTCACTCAGTTGTGGAGGAACCGGAATTCACCGATGTCATCCAAAATGTGACAGTTCCGGCCGGCCGTAGTGTGCGATTGGCATGCTCCGTCAAGAATCTTGGTTCTTACAAGGTTGCATGGATGCACTTCGAACAATCAGCGATTTTGACTGTACACAACCACGTGATAACCCGCAATCCGCGCGTTAGTGTCACCCACGATAAACACCGCACGTGGTTCCTCCACATATCAGACGTAAGGGAGGAAGACAGAGGTCGATACATGTGCCAAATAAACACAGTTACCGCTAAAACACAATTTGGATACCTCCACGTCGTTGTCCCCCCGTCCATTGACGACTCGTTGAGCTCGAGCGACGTTATCGTCCGAGAGGGCGCCAATGTCACTCTGATGTGTCGCGCTAATGGCTCTCCTAAGCCCACCATCAAGTGGAAGCGAGATGACAATTCAAAAATCTCCATCAGTAAAGGACACTCCGTTTCCGAGTGGGAGGGTGAGGTTTTGGATATGGCGCGAATTTCGAGACTAGATATGGGTGCCTACTTATGCATTGCTAGTAATGGTGTGCCGCCAACCGTTTCTAAGAGAGTAAAAGTTAGCGTGGATTTCCCACCTATGCTATGGATACCGCATCAGTTAGTAGGTGCTCCCTTATATTACAACGTTACTTTAGAGTGTTTCACTGAAGCTCATCCAACTTCGTTAAATTACTGGACACGGGATGACGGACACATGATTCACGAAAGCCCTAAGTACCATATGGAGAATACTGTGGGAGTCCCGCCCTACAAAACCCATATGAAACTTCTCATCAGGCATATTGTTACTGAAGATTACGGAACTTATAAATGTGTGGCAAAAAATCCTAGAGGAGAATCGGATGGTACAATACGATTGTACACTTCTTCCCCACCAACTACGACACCGGATCCGAGGGCCGTCACTGTTCCACCACCATCGCGCCCACCACGACGTGATACGCCAGTAACTGATAAGGGAACGAAGTATCAATCGAACTTGAACGAGATCGATAAGGGGAAGCAAAAGTCTGACGAGGGTGGCGGGAAGACACATCTCAACTGGGTAGGCGGTGCATCACAAGTTACAGATAACGCTGCAAGAAGGAGAAGCCATCCAATGAAGTTACTCGTAGTCTTTGTATCAATTTATATCAATGTATTTATTTAA

Protein sequence:

>DPOGS203828-PA
MTRERRRECRVAASTKGLQMRVPEDFNEATTLCDFIAYAYRDNRVRCVPMRDLHQLSQRNRVENGASCLANTIRRTNTLHSFHLHSVVEEPEFTDVIQNVTVPAGRSVRLACSVKNLGSYKVAWMHFEQSAILTVHNHVITRNPRVSVTHDKHRTWFLHISDVREEDRGRYMCQINTVTAKTQFGYLHVVVPPSIDDSLSSSDVIVREGANVTLMCRANGSPKPTIKWKRDDNSKISISKGHSVSEWEGEVLDMARISRLDMGAYLCIASNGVPPTVSKRVKVSVDFPPMLWIPHQLVGAPLYYNVTLECFTEAHPTSLNYWTRDDGHMIHESPKYHMENTVGVPPYKTHMKLLIRHIVTEDYGTYKCVAKNPRGESDGTIRLYTSSPPTTTPDPRAVTVPPPSRPPRRDTPVTDKGTKYQSNLNEIDKGKQKSDEGGGKTHLNWVGGASQVTDNAARRRSHPMKLLVVFVSIYINVFI-