Monarch geneset OGS2.0

DPOGS211687
TranscriptDPOGS211687-TA1146 bp
ProteinDPOGS211687-PA381 aa
Genomic positionDPSCF300374 - 227717-232943
RNAseq coverage2508x (Rank: top 5%)
Annotation
HeliconiusHMEL0053752e-15088.57% 
BombyxBGIBMGA011387-TA7e-10484.80% 
DrosophilaLac-PA5e-12468.56% 
EBI UniRef50UniRef50_F4WM475e-13170.00%Lachesin n=8 Tax=Arthropoda RepID=F4WM47_ACREC
NCBI RefSeqXP_001814833.14e-13874.25%PREDICTED: similar to lachesin [Tribolium castaneum]
NCBI nr blastpgi|30240842e-14076.32%lachesin [Schistocerca americana]
NCBI nr blastxgi|30240843e-13676.32%lachesin [Schistocerca americana]
Group
KEGG pathwaymmu:221381e-21 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[109-222] IPR0137835.2e-23Immunoglobulin-like fold
[145-208] IPR0035981.6e-16Immunoglobulin subtype 2
[239-315] IPR0130986.2e-13Immunoglobulin I-set
[139-220] IPR0035992e-12Immunoglobulin subtype
Orthology groupMCL15187 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211687-TA
ATGCATATTGATAGAAATGCATTAAATAAATCTCATTTTTCAACAGTTTATGGTCAGAGGACACCAACTATATCCCATATAACCCAAGAACAGATCAAAGATATCGGAAGTCAAGTGGACCTGGATTGTTCGGTGCACTATGCTCAAGAGTACCCCGTGCTGTGGTTAAAATACGACCAGCTGAAGAGTACGCAGTCACTGCCACTGTCCATGAATTCAGGCCTCATAATCCGCGACTCGAGATTTTCTCTCCGTTTCGACGAAGCGTCCACAACCTACACACTATCGATCAAGGACATTCAAGAGACGGATGCTGGCTGGTATCAGTGCCAAGTGCTGATAAACGCCAACAGTAAGATAACCGGTGAAGTTGAACTGCAAGTGAGGCGGCCACCCATCATATCTGACAACTCCACAAGATCCATAGTCGCCAGCGAGGGGGAGAGTGCTAAAATGGAATGCTATGCCGGTGGTTTTCCTGTACCAAAAATATCATGGCGTCGAGAAAATAATGCCATACTACCGACTGGGGGCTCCATATATCGCGGAAACATCCTGAACATAGCATCAGTACATAAGGAAGATCGCGGCACGTATTACTGCGTGGCCGAAAACGGTGTGGGGAAAGGTGCGAGGAGAAACATCAATTTGGAAGTCGAATTCTCACCGGTTGTGACAGTACCGAAACCCAGATTGGGACAGGCCCTTCAATACGACATGGACCTGGAATGTCACGTGGAAGCCTACCCACCTCCAGCTATAACCTGGCTTAAGGACGAGTACGCCTTATCAAACAATCAGCATTACAGAATCTCTCACTTCGCTACCGCTGACGAGTTCACCGACACCACGTTACGAGTTATAACTATCGAGAAACGGCAATACGGACAGTTTAAATGTAGGGCGCAAAACAAATTGGGCAGCGACGAAGGGGTCGTCGAGCTATTCGGACGAGTTCGCTTTAAACACGTACAAAATACTATCACTCTCGCACGAGAGTCAATCCGGCGCATTAGTGTTCCTGATATGTCGCATCATTGGAAAGTTGGACCCGTCCATAGCTGCGAGGTAGTCCGTTATACCTGTCTGTCCGCCCGCATGCGGCCAGGCGAGATACGGCGCGGCGAGTGTCGGGAAAGCGAGTGA

Protein sequence:

>DPOGS211687-PA
MHIDRNALNKSHFSTVYGQRTPTISHITQEQIKDIGSQVDLDCSVHYAQEYPVLWLKYDQLKSTQSLPLSMNSGLIIRDSRFSLRFDEASTTYTLSIKDIQETDAGWYQCQVLINANSKITGEVELQVRRPPIISDNSTRSIVASEGESAKMECYAGGFPVPKISWRRENNAILPTGGSIYRGNILNIASVHKEDRGTYYCVAENGVGKGARRNINLEVEFSPVVTVPKPRLGQALQYDMDLECHVEAYPPPAITWLKDEYALSNNQHYRISHFATADEFTDTTLRVITIEKRQYGQFKCRAQNKLGSDEGVVELFGRVRFKHVQNTITLARESIRRISVPDMSHHWKVGPVHSCEVVRYTCLSARMRPGEIRRGECRESE-