Monarch geneset OGS2.0

DPOGS212543
TranscriptDPOGS212543-TA1440 bp
ProteinDPOGS212543-PA479 aa
Genomic positionDPSCF300315 + 66419-80338
RNAseq coverage149x (Rank: top 53%)
Annotation
HeliconiusHMEL0145430.076.33% 
BombyxBGIBMGA008132-TA2e-10855.71% 
Drosophilawrapper-PA8e-6233.71% 
EBI UniRef50UniRef50_D6WF342e-8644.78%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WF34_TRICA
NCBI RefSeqXP_966538.27e-9545.23%PREDICTED: similar to lachesin, putative [Tribolium castaneum]
NCBI nr blastpgi|1892356911e-9345.23%PREDICTED: similar to lachesin, putative [Tribolium castaneum]
NCBI nr blastxgi|1892356914e-9245.23%PREDICTED: similar to lachesin, putative [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[194-296] IPR0137836.7e-16Immunoglobulin-like fold
[213-298] IPR0130981.1e-10Immunoglobulin I-set
[281-418] IPR0089571.2e-09Fibronectin type III domain
[22-104] IPR0131066.1e-06Immunoglobulin V-set
Orthology groupMCL15883 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212543-TA
ATGGATATAATATTTAAATTTGTATTTATTTTTGTTTTAAAATTAGCGTTGAGTGAAAAAGTTTTCAAAAGTGTACCAGTTGTGGTCAAGACTTACGAAAATGACAGCGTGTTGTTGCCTTGTTACGTTAATCATGAAGATGGCGAATTAAGAGTTCTCTGGTACAAAGATTCAACACTCCTCGGGGACAGTGCGGATCCTAACCGCCTTTTGCCGCTTCGTACAAGGATGCACGCCAACTACAGTCTCCAGGTCGACGGTTTGATAGCAGACGACACAGCAGATTACACTTGCGAGGTTATTCGTCCAGAGCCCTTGGGTCCCGTCAGACAGACACACGCCATCAAAGTTCAATATGCTCCAATAGTGAGGACTATACCTGAAGAAGGTTATTTGGAAGTCAAGAAAGGTGAATACGTTGACATTGGTTGTGAGGCGACTGGGACACCTACTCCTATAGTCAATTGGAAGAAGAATGGAGAGTCCATGGCGCTACTGGAACACAGGTCCAGGATTCGGTTCCGCGCTGAACACCGTCTTCTAGCTGGGGTGTACGAGTGTACAGCAACCAATGGCGTCGGCGACCCCATGACAGCGGCAATAACAGTTATAATACAAGACGCTCCAGTAGTAACCACATCTCGTAGTTTCGTTCATACGGCTATAGGGCTGAGAGCGGTGCTGGCATCCAAGCTAGAGTTTGCAGCACCCCCAGCTCGCACGGCCTGGTACAGAGATGGAAAACCAGTTCGGACAGACGACAGAATTATAATAATGGTCAAGGACAATGTCCATCAGTTAATATTTAGGAGCGTCCGGAAATCTGATTTCGGTAACTACACCTTCAGAGCTGAGAATAGTCTTGGTATGGCCGATGTTTCGTTCAAATTGACGGGTGTTCCAAATACCGCGTCATTTAAAGTGGATCCCTCTCTAAACAAAGCAGATGCAACAAGTTACACACTGCTGTGGGAAGTCGATAGCTACTCCAATATCATAGAGTATAATCTTTGGCTTCGTCCATACTACGGTCGTCCCGCTACCACGGAATCGGACTTCATAACGACCGAGACTCCAAACGTCTGGTCAAAGATCGTGGTACCGGGAGACTCTAATGAAGGTCCAATACACAGCGCCGCTTATTCCGTCAGAGGCTTAACTCCGTCTACCGTCTATGAGGCGGTGGTCACGTCTAGAAATAGATTTGGATGGAGTAAGCCTTCCGCTGTTCTACATTTCGCTACAGAGCCTGGAGCCGGAAAAATTTTACTCTCGACATCGGATTACACCGATTTCACTCCAATCTTAGAAGATCCTGAACCACAGCAACTTTACAATATAACACAGGCGCAAGTTTTCGAACGTTTCAACGAACTGTCAAATTCATCGAGGCGGGAAAAAATATCATTAACCTCTCGTTCGATGGCATTTTTGATATAA

Protein sequence:

>DPOGS212543-PA
MDIIFKFVFIFVLKLALSEKVFKSVPVVVKTYENDSVLLPCYVNHEDGELRVLWYKDSTLLGDSADPNRLLPLRTRMHANYSLQVDGLIADDTADYTCEVIRPEPLGPVRQTHAIKVQYAPIVRTIPEEGYLEVKKGEYVDIGCEATGTPTPIVNWKKNGESMALLEHRSRIRFRAEHRLLAGVYECTATNGVGDPMTAAITVIIQDAPVVTTSRSFVHTAIGLRAVLASKLEFAAPPARTAWYRDGKPVRTDDRIIIMVKDNVHQLIFRSVRKSDFGNYTFRAENSLGMADVSFKLTGVPNTASFKVDPSLNKADATSYTLLWEVDSYSNIIEYNLWLRPYYGRPATTESDFITTETPNVWSKIVVPGDSNEGPIHSAAYSVRGLTPSTVYEAVVTSRNRFGWSKPSAVLHFATEPGAGKILLSTSDYTDFTPILEDPEPQQLYNITQAQVFERFNELSNSSRREKISLTSRSMAFLI-