Monarch geneset OGS2.0

DPOGS203652
TranscriptDPOGS203652-TA1179 bp
ProteinDPOGS203652-PA392 aa
Genomic positionDPSCF300010 - 2843657-2865716
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0069277e-11170.69% 
BombyxBGIBMGA003449-TA3e-14584.78% 
DrosophilaCG42343-PF2e-14866.15% 
EBI UniRef50UniRef50_E2AYH21e-15065.30%Lachesin n=4 Tax=Pancrustacea RepID=E2AYH2_CAMFO
NCBI RefSeqXP_972650.24e-16078.39%PREDICTED: similar to CG32791 CG32791-PA [Tribolium castaneum]
NCBI nr blastpgi|3800149023e-15972.14%PREDICTED: lachesin-like, partial [Apis florea]
NCBI nr blastxgi|2700148043e-15473.42%hypothetical protein TcasGA2_TC010786 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[246-319] IPR0137832.5e-18Immunoglobulin-like fold
[232-319] IPR0130981.7e-11Immunoglobulin I-set
[21-123] IPR0035992.7e-08Immunoglobulin subtype
[140-211] IPR0035987.5e-08Immunoglobulin subtype 2
Orthology groupMCL16408 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203652-TA
ATGAATACCTGCTTTTGTATCGTCCCAGCGTTAGGTCTGGAGCCGGACTTCCTGTATCCGCTGGAGAACGTAACGATCGCTCAGGGAAGGGACGCGACATTTACGTGTGTGGTCAATAACCTTGGTGGTTATAGGGTGAGTGGCGACTCCGCCACCGCCAGGGTAGCATGGATCAAAGCGGATACAAAAGCGATTTTGGCGATTCACGAACATGTTATCACAAACAATGCAAGACTGAGCGTAACACACAACGACTACAACACATGGACACTAAATATTCGTGGAGTGAAAAGAGAAGACAGAGGACAATATATGTGCCAAGTAAATACAGATCCTATGAAAATGCAGACAGCATTTTTGGAGGTTGTTATTCCTCCAGATATAATTTACGAAGAAACCTCCGGAGATATGATGGTGCCGGAAGGTGGTGGGGCAAAATTAGTGTGTAAAGCCCGAGGATTTCCTCCTCCAAAAATTGTATGGAGAAGAGAAGACGGTGGTGACATTATATCAAGAGGTGGACCTCAAGGAAAAACAAAAGTGACTTCACTGGAGGGTGAGATCGTAAACCTCACTAAAGTGACGAGGTCCGAGATGGGAGCATATCTATGCATCGCAGCCAACGGAGTTCCACCTTCTGTTAGCAAAAGGATAATGCTTCATGTGCACTTTCACCCGCTAGTACAAGTACCGAATCAACTGGTTGGAGCTCCGACAGGAACGGACGTCACTTTGCAGTGTCATGTTGAGGCATCCCCAAAAGCCATCAACTATTGGACAAGGGAGAATGGTGAAATGATAATATCAAACGACAAGTATGAGATGAGCGAGATAAACAGTTCAGCTTATAGTGTTCAGATGAGGCTTGTTATACGGAATATACAGCGTAACGACCTTGGTGGATACAAGTGTATATCGAAAAACTCCATTGGGGACGCCGAGGGCAATATTAGATTATATGAGATGGAATTGCCATATCGAAAGACGCGTCTTGATGATGAAAGGGATTCAGAGCTAGAAGAAACAAATGACGTCAGGTCCAGCCTCCAAGGATCACTCCGTGAGGGTGGTCGTCGTGAAGACGTCAGTTTCCTCGGTGAAAGCGGTCCATCTAAGGCATCATCGAGTGAATCCACCTCCTACATAATGTTTTTAGTATCGGTATACCTGATATGTTAA

Protein sequence:

>DPOGS203652-PA
MNTCFCIVPALGLEPDFLYPLENVTIAQGRDATFTCVVNNLGGYRVSGDSATARVAWIKADTKAILAIHEHVITNNARLSVTHNDYNTWTLNIRGVKREDRGQYMCQVNTDPMKMQTAFLEVVIPPDIIYEETSGDMMVPEGGGAKLVCKARGFPPPKIVWRREDGGDIISRGGPQGKTKVTSLEGEIVNLTKVTRSEMGAYLCIAANGVPPSVSKRIMLHVHFHPLVQVPNQLVGAPTGTDVTLQCHVEASPKAINYWTRENGEMIISNDKYEMSEINSSAYSVQMRLVIRNIQRNDLGGYKCISKNSIGDAEGNIRLYEMELPYRKTRLDDERDSELEETNDVRSSLQGSLREGGRREDVSFLGESGPSKASSSESTSYIMFLVSVYLIC-