Monarch geneset OGS2.0

DPOGS211122
TranscriptDPOGS211122-TA2733 bp
ProteinDPOGS211122-PA910 aa
Genomic positionDPSCF300007 - 428907-433188
RNAseq coverage124x (Rank: top 57%)
Annotation
HeliconiusHMEL0124190.053.43% 
BombyxBGIBMGA002998-TA3e-8057.83% 
DrosophilaTango6-PA8e-2228.69% 
EBI UniRef50UniRef50_E2A1034e-8229.82%Transmembrane and coiled-coil domain-containing protein 7 n=8 Tax=Formicidae RepID=E2A103_CAMFO
NCBI RefSeqXP_001814851.13e-7627.85%PREDICTED: similar to rCG51257 [Tribolium castaneum]
NCBI nr blastpgi|3838502855e-9331.57%PREDICTED: uncharacterized protein LOC100882243 [Megachile rotundata]
NCBI nr blastxgi|3838502855e-9831.57%PREDICTED: uncharacterized protein LOC100882243 [Megachile rotundata]
Group
Gene OntologyGO:00054881.1e-17binding
KEGG pathwaymbr:MONBRDRAFT_251052e-18 
 K01763 (SCLY)maps-> Selenoamino acid metabolism
InterPro domain[288-887] IPR0160241.1e-17Armadillo-type fold
[638-881] IPR0119891.6e-07Armadillo-like helical
Orthology groupMCL17517 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211122-TA
ATGTCCGACGTTAATTGTATATTTAAGCAAATAGAAAAGATTCTGAAAATTGACACAAACACCGAGTTCATGGTTGCGGTTTTCAATGAAATCATCAAGTGTAATAGCAGCTTGAACGAAAATGATATATTTGACGTTTTAAGGACATTTCTAAACAACATAATCAAAGAAATCGATGAACTTGGTACTATCATCAAAAACAACGATGGTGTCAGTATCAGTGTCAAAAATCAAAAAATGTTGCGTACCTGTTATCAAATAATTACCTCGTTTGGTATATCATCATGTTTATTGCCTGGTTTAGGAATCAGCCTGTCTAAGAGGTGTGCCACTGCGAAGTCACTGCCAACATTATCACTTAAAGATACAGAGAAATATGAACTACTAGTTTGGTGTACAGATTTCTTGTCCAGAAGTTATGAAGTACCAGTACTTAAAAATATTATATTAACTTTTCATTTGTCCGATTATCTGGCAGCACTTATACAATTAGCCTTTGCCCCTTTGAAGAAACCTGGAGATTATTCAAATTTCACTATGACTCAAGAGATGTACGACAAGTTATTGTTTGATAAGCAGAAATATATAAAAACTTATGAGTACTTAGTGAATAATTGCTTCCAGCCAATGTTAATGAAGGAACTTTTAGTATTACAGAATATAACAGAGCACCCACCTCCAATGTTTGCTAAGAAAGTTATTTCAAAGGAAATGAGTAAGCGTCTCACAACTTGGGGTGGGTTACTCAGCCTGATTAGATGTTTCATCGAAAGCCACGAGGTCGACGTCGGTGTCGAGTGGAAGAAAATAGAAATGATCTGTAAAATAGTTACTTGTCGGCACTTGAACTTGAGTGAAGAGGACTATTTGAGTAACATTGTCTCGCAGTTAAGGCATATATACACTATGAATAATAAACATTACTTGATAACAGCATCTTCGTGCTTGTTAAGCCTATATACAAAATACAATAAATCCACATCAGTGATAAATTTGCTCAATGAAGTCTTTGGTAGTTTTGATCATGAAGCCTTATTGGCCGATGCTTTACCGGGAACTATTATATTGGTGTCACAGCAAGTCCAACACAACATACAGATCCTTCAGGCATGTACAGCTATAACGCAATATGAGCTACCAATACAAATGTCCAAGAACTTGTATGTGCTGTATCTCTTAAGGCTCAACTGTACAAAAACTGAAATGAAGCTCAAATTAAATGACATTATACTCAAAATTATGGAGTTGTTAAATAAAAGTGAGATAAAAATTGTAATTGAACAAATTCTTTTTGGACTGAACAATCACAATTCTCATAAAATCATAGCTAAAGAATACGAGTCCGGTCTTTCAGTGAAATGTGTAACAGCGGACTTTGAATATCCCAGCGACGAGGCAGTTATATATTTTATAGAAATGTTTAATTTAATAACGAACAATGATGTCGTTTGTAACATATTCGAAGCGTGCTTATTAAAATTTATTGAATTGAATAAAGAAAACGAAACATGCGATAAAGAGGCTTTTCTGTTGGTCGAAGATGAGCCTGAGGTGCTTAACTCGGTTAGCAAAAAGTGTGCTCATATGCTTCATATCCTATCCGAAATATCAGCAACCGAGAAGGTTATAACCATTTTGAAAGACAAACCACTTCTTGTGCTAGATTTCGTTGAATCATTATTACTAAATAATATAAACCCAATAAATGATGAGTGTTGTACTATTGCTCTTGTTCTTCTCAATACTATTGTAGCTAACATTGAGAAAACCGAGGACATACAAACAAGACTTAATGGCCTGATGCCGAGACTGAAACAATTGTCAGGGGAAAATTCTTCGTATGTAAATGTTTTGTCCAAGGAAACGTTGTCTTTGATCGAAATGGAATGTCCAAAAGCTGATAAAAGTGCTTACGAAAAAGCAGTGTCCAATATTTATGACAAGTTACTGCCAGTGCGAGTTCATGGGGTCATTGAACTCACTAAACTAATCGACAGATCAGATGTGGAGACCATTTCTAAGAGACATTTCATATTTTGTCTCTTCCAGGAACAACTCAGACATCCCGATTCCTATATGTATCTAGCATCAGTGAACGGTATAGCCTCGCTAGCTATGCACTGCACCGCTGAAGCGTTGTCTATACTTTGTAGAGAATATTTAGAAGTTTCTCCGGATATCAGAAATAATGAGAGTGAAAACCAAAACGCAGAACTCAGAATGAAAATAGGAGATGTTATAGTTAAAGTTACGAGGAGACTAGGCGAAATGGCGGTAGTTCATAAAACGATTTTACTTAACACGATGCTGTGCGCCTGCAAGGATGATGACCCGCTGATAAGAACGTCGGCTCTATCGAATCTAGCAGAAATAGCATTGGTTTTGAATTACAAAATCGGTTCTATTTTATACGAATTATTGCTATGTGTTTGGGACGTTATAAATGGGGATCCGGCGTTGGAATGTCGGAGAGCAGCTGTGATGGTTTTGGCCAACCTAATCAAGGGTCTTGGCAAAGATACCTTAGTAGAACTAAACGATACTCTACTACCCATCTATAAGACTCTCCTCAAACTCTACAAAGATGATGACGAGGATTCGTTAGTTAGGCTCCACTCGCAAATCGCTCTAGAGGAACTAAATGACATCGTCAAAGGATTCCTCACACAATGTCTTCCGTACGAAAAAGAAATTTCACTAACAACGACCCCCAATAATATAATTTTCAAATAA

Protein sequence:

>DPOGS211122-PA
MSDVNCIFKQIEKILKIDTNTEFMVAVFNEIIKCNSSLNENDIFDVLRTFLNNIIKEIDELGTIIKNNDGVSISVKNQKMLRTCYQIITSFGISSCLLPGLGISLSKRCATAKSLPTLSLKDTEKYELLVWCTDFLSRSYEVPVLKNIILTFHLSDYLAALIQLAFAPLKKPGDYSNFTMTQEMYDKLLFDKQKYIKTYEYLVNNCFQPMLMKELLVLQNITEHPPPMFAKKVISKEMSKRLTTWGGLLSLIRCFIESHEVDVGVEWKKIEMICKIVTCRHLNLSEEDYLSNIVSQLRHIYTMNNKHYLITASSCLLSLYTKYNKSTSVINLLNEVFGSFDHEALLADALPGTIILVSQQVQHNIQILQACTAITQYELPIQMSKNLYVLYLLRLNCTKTEMKLKLNDIILKIMELLNKSEIKIVIEQILFGLNNHNSHKIIAKEYESGLSVKCVTADFEYPSDEAVIYFIEMFNLITNNDVVCNIFEACLLKFIELNKENETCDKEAFLLVEDEPEVLNSVSKKCAHMLHILSEISATEKVITILKDKPLLVLDFVESLLLNNINPINDECCTIALVLLNTIVANIEKTEDIQTRLNGLMPRLKQLSGENSSYVNVLSKETLSLIEMECPKADKSAYEKAVSNIYDKLLPVRVHGVIELTKLIDRSDVETISKRHFIFCLFQEQLRHPDSYMYLASVNGIASLAMHCTAEALSILCREYLEVSPDIRNNESENQNAELRMKIGDVIVKVTRRLGEMAVVHKTILLNTMLCACKDDDPLIRTSALSNLAEIALVLNYKIGSILYELLLCVWDVINGDPALECRRAAVMVLANLIKGLGKDTLVELNDTLLPIYKTLLKLYKDDDEDSLVRLHSQIALEELNDIVKGFLTQCLPYEKEISLTTTPNNIIFK-