Monarch geneset OGS2.0

DPOGS216144
TranscriptDPOGS216144-TA1482 bp
ProteinDPOGS216144-PA493 aa
Genomic positionDPSCF300214 - 258857-264411
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0026735e-7138.52% 
BombyxBGIBMGA010265-TA3e-14660.60% 
DrosophilaCG17739-PA2e-5430.23% 
EBI UniRef50UniRef50_D2A0T91e-5730.94%Putative uncharacterized protein GLEAN_07204 n=2 Tax=Tribolium castaneum RepID=D2A0T9_TRICA
NCBI RefSeqXP_971627.28e-5730.74%PREDICTED: similar to f-spondin [Tribolium castaneum]
NCBI nr blastpgi|3504164054e-5834.17%PREDICTED: spondin-1-like [Bombus impatiens]
NCBI nr blastxgi|3504164054e-5933.81%PREDICTED: spondin-1-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[21-161] IPR0028611.3e-14Reeler domain
Orthology groupMCL30736 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216144-TA
ATGTATATTATCCTCGTAATACTGTTATCGGCTACATTAGCATCTGCCCAGCCTGGCAAATGTGATCAAACGCCGCCCCAAGCCACGATTTTGACGCCTAGCGAAAATAAAGGCCTTTTCAAAATGACCATCGACTCGAAGGGCGACGATTCCAATATCTATAGACCCGATCAGACTTATATACTTATTCTAACAACAAGCAACACGACCCGCCCATTCCGATGGTTCATGATAACTGTGGAAGATCCGGACATCGACAAAGGTGCTTTTGAATTCGACCTGAGATCAGAAGACGTGGGGAGCTTGAAGACGACTAACAGACAAGCGAGATACAGTGAACGATGTTACAACTCTGTCGAGAACACTGACAATTCGGATAAATATCGAGTTGAGATACATTGGGTCTCTCCAAAACAATATGAGGGCAAGGAGAAAGTACGTCTTCGGGCTATGATAGCTGAGAATGGAGAGTCATGGTATGTTGGCGAAAATCTAACGATTGAGTTGAAGAAGGATGATCAGAGAGCTTTGGACAGCCCCCCGTTTGATCCTGTGGACCCATGTAATTTATGCAGCGAAGCCAGATATGAGGTGATATTTAAGGGTCGTTGGTCTAGAATGTCTCATCCTCTTCATTATCCAAGTAGACCTGATGACAACAGCTACAGCCACATGGTTGGAGCTTCCCACGCATATAAATACCTTCTGTGGAAGCCTGGAGACAAAGCCAGTCTCGGCTTAAAGAGATTGGCCGAGGACGCAAACGTTACGGAAATCGAAAGGGAAATAATAAAGGCGATGTCGAAGGAAAATGGTACTAGGACTTTGATCAGAGGAAAACGTCGCCGTCACCCACACATGTTTGAACCAGCACATTCTTTATTCAGAGTGGATCGTGTACACCATTTATTCTCATTAGTGGTTGCTATGAAGCCGTCACCGGATTGGTTCTTAGGAGTGTACAATTTCGAACTATGTACGAAAGAAGGTTGGTTAGAGGACTACGAAATACCATTGTATCCATGGGACGCTGGCACTATGGACGGAGTTTCATACGAGTCGCCTAGATCAGTGACGCAGCCAGTCGATAATGTAGAGAGAGTGGCGGTCGGGTCATTTGATGCGGATTCCCCGTTTTATCAGTTGAACTTAAACGATCTGAAGGCTTTCGCTAAGCTTCAAGTTACGAGACTTGATGTCTACCCTCTGGTCGGAGTTGACTGTGAAGGTACAAACGAGGAGGGTCCCCAGGAAGAAGGGGAAAAACAAAATGCGGAGAACATCGCAGAACCACTTCTGCTTGAATCGAGACAATACTTAGACCAAAAGCAATGCGCCCTAGGCAAGTGGAACGAATGGTCTCCTTGTGTACCAGACTCAGGGAATTGTGGCCCCGGGACTCAGCTGAGGACGCGAAACAAACAACGACATTACAACGTAAGGGCATTACTGTTTAATTGTGTTGAAGAATGTCAATTATAA

Protein sequence:

>DPOGS216144-PA
MYIILVILLSATLASAQPGKCDQTPPQATILTPSENKGLFKMTIDSKGDDSNIYRPDQTYILILTTSNTTRPFRWFMITVEDPDIDKGAFEFDLRSEDVGSLKTTNRQARYSERCYNSVENTDNSDKYRVEIHWVSPKQYEGKEKVRLRAMIAENGESWYVGENLTIELKKDDQRALDSPPFDPVDPCNLCSEARYEVIFKGRWSRMSHPLHYPSRPDDNSYSHMVGASHAYKYLLWKPGDKASLGLKRLAEDANVTEIEREIIKAMSKENGTRTLIRGKRRRHPHMFEPAHSLFRVDRVHHLFSLVVAMKPSPDWFLGVYNFELCTKEGWLEDYEIPLYPWDAGTMDGVSYESPRSVTQPVDNVERVAVGSFDADSPFYQLNLNDLKAFAKLQVTRLDVYPLVGVDCEGTNEEGPQEEGEKQNAENIAEPLLLESRQYLDQKQCALGKWNEWSPCVPDSGNCGPGTQLRTRNKQRHYNVRALLFNCVEECQL-