Monarch geneset OGS2.0

DPOGS203849
TranscriptDPOGS203849-TA1221 bp
ProteinDPOGS203849-PA406 aa
Genomic positionDPSCF300010 + 3039962-3054549
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0030355e-15577.43% 
BombyxBGIBMGA003448-TA4e-11572.97% 
DrosophilaCG32791-PA4e-12666.02% 
EBI UniRef50UniRef50_Q9W4R35e-12466.02%CG32791 n=25 Tax=Endopterygota RepID=Q9W4R3_DROME
NCBI RefSeqXP_972399.12e-14264.85%PREDICTED: similar to CG32791 CG32791-PA [Tribolium castaneum]
NCBI nr blastpgi|2700148072e-14167.05%hypothetical protein TcasGA2_TC010789 [Tribolium castaneum]
NCBI nr blastxgi|2700148074e-13965.03%hypothetical protein TcasGA2_TC010789 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[249-322] IPR0137832.1e-20Immunoglobulin-like fold
[229-322] IPR0130981e-14Immunoglobulin I-set
[143-214] IPR0035988.1e-09Immunoglobulin subtype 2
[235-325] IPR0035991.6e-06Immunoglobulin subtype
Orthology groupMCL18285 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203849-TA
ATGGTCGCCTCGCAAGGCGCTTTCTCTGAGAAATATATAGCCTATTTGAAGCGCAGAAAGATGGGAGGTGTGGAGGTGCCAGAGTTCGGGGAGCCAATAACCAATCTGACAGTTCCCATCGGGCGAGACGCAACCTTCAAGTGTATAGTCGTCAATTTAGGCAATTATAGGGTTGGCTGGGTGAAAGCGGACACAAAAGCGATCCAGGCTATCCACGAACACGTAATCACGCACAACCACAGGGTGTCCGTATCACACGCTGACCACTCAACGTGGTATCTTCACATAAAGAACGTGCAAGAAGAGGACCGCGGCCAGTATATGTGCCAAATAAATACCGACCCTATGAAGAGTCAGATGGGCTATCTCGAAGTAGTTATACCCCCTGACTTTATACCGGAAGAAACTTCTGGAGATACGATGGTGCCTGAAGGTGGGACGGCACGTGTCTCCTGTAGAGCAAGGGGGATCCCCCCGCCGAGAGTAATGTGGAAACGCGAAGATGGCCAAGAAATAGTCGTGAGGGACGCTACTGGGGCAAAGACAAAAGTGCTTACATACCAAGGTGAGGTATTAAAGTTGACCAAGATATCGCGTTCCGAAATGGGCACATACTTATGTATAGCCGGTAATGGAGTGCCGCCCACAGTGAGTAAGCGGATGCACATAAGTGTACATTTTCATCCAGTGATCCAAGTGCCAAATCAGTTGGTTGGTGCGCCGCTCGGCACCGACGTTACCCTCGAATGCTACGTCGAATCGTCGCCAAAATCCATCAACTACTGGGTCAAAGATCCCGGTGAGCTGATAATACCATCTGAACACCACGAGATGACTGTCAGACAAAAGTCGATGTTCGAAGCTGAGATGTCCATGACTATCAAGAACATCAGACGAGAGGACCTGGGAAGTTACATATGTGTAGCGAAAAATTCTCTTGGCGACGTTGAAAGCAAAATCCGATTATACGAAATACCAGGAAACGACAGACACATATATCAATACACTGATGAAAGAACGACTTCGGACGATGAATACGGCACAGAAGTATATGACGATGACTTTGAAGACAAAGAAAAGAAAAGTAATCGAGTCCCAGATCGTGCTAACAAATGGTACTCCAACGATGGTAAGCTCATCGTGACTGCTAACAACGCGATAGGCGTTCTGTCAGATGCTTATATCATTACACTAGTAAATGTACTCAAATTCTTAGCATAA

Protein sequence:

>DPOGS203849-PA
MVASQGAFSEKYIAYLKRRKMGGVEVPEFGEPITNLTVPIGRDATFKCIVVNLGNYRVGWVKADTKAIQAIHEHVITHNHRVSVSHADHSTWYLHIKNVQEEDRGQYMCQINTDPMKSQMGYLEVVIPPDFIPEETSGDTMVPEGGTARVSCRARGIPPPRVMWKREDGQEIVVRDATGAKTKVLTYQGEVLKLTKISRSEMGTYLCIAGNGVPPTVSKRMHISVHFHPVIQVPNQLVGAPLGTDVTLECYVESSPKSINYWVKDPGELIIPSEHHEMTVRQKSMFEAEMSMTIKNIRREDLGSYICVAKNSLGDVESKIRLYEIPGNDRHIYQYTDERTTSDDEYGTEVYDDDFEDKEKKSNRVPDRANKWYSNDGKLIVTANNAIGVLSDAYIITLVNVLKFLA-