Monarch geneset OGS2.0

DPOGS210902
TranscriptDPOGS210902-TA1788 bp
ProteinDPOGS210902-PA595 aa
Genomic positionDPSCF300045 - 253507-255309
RNAseq coverage332x (Rank: top 35%)
Annotation
HeliconiusHMEL0158260.086.46% 
BombyxBGIBMGA003086-TA0.082.24% 
Drosophilatrn-PA5e-10344.92% 
EBI UniRef50UniRef50_D6WNE79e-14058.73%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WNE7_TRICA
NCBI RefSeqXP_972275.16e-14856.06%PREDICTED: similar to leucine-rich repeat-containing protein 4B [Tribolium castaneum]
NCBI nr blastpgi|910837751e-14656.06%PREDICTED: similar to leucine-rich repeat-containing protein 4B [Tribolium castaneum]
NCBI nr blastxgi|910837751e-14656.06%PREDICTED: similar to leucine-rich repeat-containing protein 4B [Tribolium castaneum]
Group
KEGG pathwaydre:1001481534e-34 
 K06260 (GP5)maps-> Hematopoietic cell lineage
    ECM-receptor interaction
InterPro domain[304-352] IPR0004833.9e-08Cysteine-rich flanking region, C-terminal domain
Orthology groupMCL12681 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210902-TA
ATGCAGTTTTACACGGAACTTCAACACTTGGACCTGTCTCAGAATCATCTCGTCAGCATACCAATGAAAAACTTTGCATATCAACGAAAGTTACAAGAACTCCATCTTAACCATAACAAAATATCTTCAGTCACAAACACGACATTCCAAGGACTCAATTCATTGACCGTTCTCAACCTGAAACGTAACTTTTTGGAAGAACTTACAAATGGTGTATTTTCTACACTGCCGAGACTAGAAGAATTGAACTTAGGACAAAATAGAATATCAAAAATAGAGCCGAGAGCATTCGCTGGATTGTCTGCTTTGAGAATTCTTTATTTGGATGACAACGAGTTGAGTTCGGTCCCAACAACATCCTTTAGTCTTCTAGGCAGTCTCGCCGAGTTACACGTTGGCCTTAACGCTTTTTCTTTTTTACCTGATGATGCTTTCGCGGGTCTCAATAGGCTGGCAGTATTGGACCTTAATGGAGCTGGACTCTTTAATATAAGCGACTTTGCATTTAGGGGTCTCCCAGGATTAAGAAGCCTAAACCTTTTTGGGAACCGATTGAGTGTGGTTCCTACGCAACAGCTTTCTAGCTTGACGAGACTCGAAGAGTTATATATAGGCCAAAACGACTTTATCGTTTTAGAAAGTCACTCATTTAAAGGATTAAAAAATCTTAAACTTATAGACATAACGGGAGCGACTCAACTTAAACGAATAGAAAAAGGCGCTTTCGAAGATAATATCAACTTGGAATCTATTGTATTAACAAATAATAAAGAATTGTCCACCATAGAAGATTGTACTCTTCTAGGCTTGCCTAAATTACGACATGTATCATTGAGAGATAATGCCATAAAAGTGCTCAGTGAGAGCGTATTTGTAGGAAAAGAATTGAAGCAACTCGATTTAACAGACAATCCAATCATTTGCAACTGCAAAATTCTATGGTTACAGCAATTATTAAATGAGAAGAGCAATTTTTCTCAAGTGCAATGTGCCAGTCCAGAAAATTTAAAAGACAAATATTTAAAAACATTGACCGCCGAGGACTTGGAATGTGTTTTATACGATAGTCGACGGCAAACAATTATATGTATTGTAGGATTCGCGTGTCTCGCTGTTGTTGCAACACTGTTACTAATATTATACAGATATCGGAAGAGCATGCAGGAGAAACTCAAGGATTATAAGTGGAATAAGGGTCGTAAGAATTTAGAATACCACAAACCCATTTCCACGGAGGAGGACTGCATCGTTCGGGGCATCCACCCATCCCAGTACCCGGCGCCGCCGCACGCGCCCGGCCTCAGGCCTATCCCGCTAGAGCTTTACGCCTCCCCGAGCGTTTTCTTCATGTCAGGCGGCCAGGGCTCCGCGCGGCGGCCGCAGCGCTCGGGAGGCGGCGGCACTATGCCCGCCAACGGTTTCACGTACATCGGAGGCGATGGACGCCACCATCAACCACAAACCCTCAACAACGGTGCACCAGCGCACCACCTCAACAACGGCTCATTGCGTTCCTTGCCAGACAAAAAGAACCGCAATGGCGTCGTCTGTCACCCTGAAAACTTCCAACGTAATCTCGACACCCGATATTCGAGGAAACAGGAGAATGGTTACATACGTAACTCGGAAACCATAATAGGTTTTCCTCGGGACCGGGAGAGGGAGCATGACTACGAGCGGGATGTCCCCGACTACAGCGAGCCAGAGTACTCCATCATCCCTGAGAGCTACGGCCGACCGGAGGACTTCCCTCGCTCGTGCAGCCGCTCCAACACCTTCAACTGTTGA

Protein sequence:

>DPOGS210902-PA
MQFYTELQHLDLSQNHLVSIPMKNFAYQRKLQELHLNHNKISSVTNTTFQGLNSLTVLNLKRNFLEELTNGVFSTLPRLEELNLGQNRISKIEPRAFAGLSALRILYLDDNELSSVPTTSFSLLGSLAELHVGLNAFSFLPDDAFAGLNRLAVLDLNGAGLFNISDFAFRGLPGLRSLNLFGNRLSVVPTQQLSSLTRLEELYIGQNDFIVLESHSFKGLKNLKLIDITGATQLKRIEKGAFEDNINLESIVLTNNKELSTIEDCTLLGLPKLRHVSLRDNAIKVLSESVFVGKELKQLDLTDNPIICNCKILWLQQLLNEKSNFSQVQCASPENLKDKYLKTLTAEDLECVLYDSRRQTIICIVGFACLAVVATLLLILYRYRKSMQEKLKDYKWNKGRKNLEYHKPISTEEDCIVRGIHPSQYPAPPHAPGLRPIPLELYASPSVFFMSGGQGSARRPQRSGGGGTMPANGFTYIGGDGRHHQPQTLNNGAPAHHLNNGSLRSLPDKKNRNGVVCHPENFQRNLDTRYSRKQENGYIRNSETIIGFPRDREREHDYERDVPDYSEPEYSIIPESYGRPEDFPRSCSRSNTFNC-