Monarch geneset OGS2.0

DPOGS205076
TranscriptDPOGS205076-TA1206 bp
ProteinDPOGS205076-PA401 aa
Genomic positionDPSCF300074 + 59450-65659
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0121211e-7175.88% 
BombyxBGIBMGA006872-TA3e-5256.22% 
DrosophilaCG12484-PB5e-2231.11% 
EBI UniRef50UniRef50_E0VYF55e-2732.30%Fasciclin-2, putative n=1 Tax=Pediculus humanus corporis RepID=E0VYF5_PEDHC
NCBI RefSeqXP_002431149.11e-2732.30%Fasciclin-2 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420214332e-2632.30%Fasciclin-2 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420214334e-2732.30%Fasciclin-2 precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055152.1e-08protein binding
KEGG pathway 
InterPro domain[188-293] IPR0137831.2e-10Immunoglobulin-like fold
[21-87] IPR0109932.1e-08Sterile alpha motif homology
[185-291] IPR0131068.2e-08Immunoglobulin V-set
[183-285] IPR0035998.1e-07Immunoglobulin subtype
[30-81] IPR0115108.1e-07Sterile alpha motif, type 2
[313-375] IPR0131626.6e-06CD80-like, immunoglobulin C2-set
Orthology groupMCL25284 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205076-TA
ATGGATCCCGAAAAAGATGGGGTACTGAAGTTGCCCACCAAAAATCTCATCGATCCTACCGGCTTTGATGTGACTATGACGGCAGTACTGATCGGCTTAGGAGCTGAAAAATATATTGATGTTTTTAGAAAACATAACATTGGTCAATGCACTTTGATGGAATTAAATGATGAGGATTTGACAATACTAGGAGTAGACGATCCTTCCATAAGACGTAAATTAATTGAAGAAGTAAAGAATCTTCCTACCTACGACGAAACAAAGCAACCGGGACAAAGTAATAACTTAGATCTCATTGAAATAATAGACGTCATAGAAGAAAGCACCCAACACCTTTATAGGATATATTTAAGTATGATGACAAATACGCTTGCTCTTAAAAAAAACAAAGTGGAGGATTATCTCATTGAAAAAGATAAATATGCATCAAACATTTCAATATCCACACTCAGTGAGATGACTAATATTCTAAATTCCATGGATATTGCGCTGCACAGCCAAATTAAAGTACTATCACAGAGGTCCAGAGAGAGAAGAAATAAGAAAATCATAGTTGTGGAAGCTATATTGGGACAAAGCGCGGAGCTACCTTGCAACGTAACAGCAGAGAAAGAAGATGACAAGCTAAATATTTTGGCTTGGTATAGAAATGGTTCCACAACTGCCTTCTATAGCAGTCGAGACCTGCGTGGCGCGGGTAGCGTCATGTCGTCAGGCGGGAGATACCGTCTCGTGGCTTCAGAGAGTGAGGGTTTGGACAAGCTGCAGATCCTCAGCGTGAGGGCCTCTGATGCTGGGCTGTACCACTGTCACGCGGACTTCGCGACCAAGCCTCCACAGAGACTTTGGGTGATACACGAGAATGGTACACGCGTAACTACAGCTAGTATCGGCTCTAACACGAGCAGAAACATTGGTCCTTACTACGTCGGCGATACCGTGCATCTTTTTTGCGTCGCATTCGGCGGGAAGCCGCAATCGTCTCTTTCCTGGTGGGCGGACCAAAGAATGCTCAAGGACACGAGCACCCCCCTTTCGGAACAGCGCGTTAGGAGCGATCTCATGTACGGTCCACTGAGGAGGGAAGACCACGGACGAGTCCTCACTTGCTTCGCCAAAAACAACGAACGGACGCCACCACTCACCATTGACGTCACCATAGATATGTTCTCTGTACTTAATAGTATAGGAGAGGTGTACCTCTAG

Protein sequence:

>DPOGS205076-PA
MDPEKDGVLKLPTKNLIDPTGFDVTMTAVLIGLGAEKYIDVFRKHNIGQCTLMELNDEDLTILGVDDPSIRRKLIEEVKNLPTYDETKQPGQSNNLDLIEIIDVIEESTQHLYRIYLSMMTNTLALKKNKVEDYLIEKDKYASNISISTLSEMTNILNSMDIALHSQIKVLSQRSRERRNKKIIVVEAILGQSAELPCNVTAEKEDDKLNILAWYRNGSTTAFYSSRDLRGAGSVMSSGGRYRLVASESEGLDKLQILSVRASDAGLYHCHADFATKPPQRLWVIHENGTRVTTASIGSNTSRNIGPYYVGDTVHLFCVAFGGKPQSSLSWWADQRMLKDTSTPLSEQRVRSDLMYGPLRREDHGRVLTCFAKNNERTPPLTIDVTIDMFSVLNSIGEVYL-