Monarch geneset OGS2.0

DPOGS206804
TranscriptDPOGS206804-TA1995 bp
ProteinDPOGS206804-PA664 aa
Genomic positionDPSCF300001 - 4159326-4182964
RNAseq coverage574x (Rank: top 22%)
Annotation
HeliconiusHMEL0178531e-14299.59% 
BombyxBGIBMGA000628-TA0.091.29% 
DrosophilaBest2-PA0.065.09% 
EBI UniRef50UniRef50_B4N3X80.065.37%GK13577 n=2 Tax=Drosophila RepID=B4N3X8_DROWI
NCBI RefSeqXP_392428.20.067.41%PREDICTED: similar to Bestrophin 2 CG10173-PA [Apis mellifera]
NCBI nr blastpgi|3287901670.067.80%PREDICTED: hypothetical protein LOC408898 [Apis mellifera]
NCBI nr blastxgi|3287901670.068.24%PREDICTED: hypothetical protein LOC408898 [Apis mellifera]
Group
KEGG pathway 
InterPro domain[2-528] IPR0006151.2e-239Bestrophin
[1-320] IPR0211345.2e-103Bestrophin/UPF0187
Orthology groupMCL11894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206804-TA
ATGACTATATCGTACGCCGGTGAAGTTCCAAATGGCAGCAGTTTTGGATGTTTCTGGCGGATCCTCTGCAAATGGCGTGGCAGCGTTTACAAGTTGGTGTGGCGGGAGCTGCTGGCGTACCTCACTCTCTACTATACCATCAACTTGCTGTATCGGTTCGCGCTCACCGAGCATCAACAAAGAATATTCGAGAAGGTTCGACAGTACTTCGGCGCTCAGAGCGAGTCTATTCCGATGTCCTTCGTCCTGGGTTTCTACGTAAGCCTGGTAGTGAAGCGTTGGTGGGAGCAGTACAAGTTGCTGCCTTGGCCGGATACACTCGCGCTGTTCATCTCCGCGGGCATCCCCGGAGCGGACGAAACCGGGCGGCTGATGAGGCGGAATATAGTCAGATACGCCATTTTGGCATACGTGATCACCTTGCAGCGAGTCTCACTCAGGGTCAAGAGACGGTTCCCCACGTGGCAGCACGTTGTGGACTCCGGTCTTATGTTGGAGAGTGAGAGAAAGGTATTCGAGAAGATGGACGGTAAGAGCCCAATGTCTAAATACTGGATGCCCCTGGTGTGGGCGACGAATATCATCAACAGGGCGCGGAAGGAAGGCTTGATCACCAGCGACCACATCGTGCAAACTCTCTTGGTGGAGCTGTCTGACATCAGGCGACGGTTGGGAGCGCTTATCGGGTACGACACTGTGTGCGTGCCTCTCGTCTATACACAGGTTGTGACATTAGCCTTGTACACATACTTTGTGGCAGCACTGATGGGTCGGCAGCTGGTACCACCTGCTCCAGGTAGTACCTCCAAATACGAACCAGATGTTTACTTCCCGTTATTCACAGCCTTACAATTTTGTTTCTACGTAGGCTGGCTAAAGGTTGCGGAGGTTCTTATAAACCCATTCGGCGAAGATGACGATGACATTGAGCTTAACTGGTTAATCGACAGACACATAAAGGCGGCATACATGATAGTGGATGAAATGCACGAGGAACATCCAGAACTTCTCAAAGACCAGTACTGGGAGGAAGTAGTTCCAAAGGACTTGCCGTACACTGTTGCCTCAGAGCATTACCGCCGCCACGAGCCGCCCTGCTCCGCTGACCACTACAAAGTAAAAGCAGAAGACGCTGTATACGCCAATGTACAAGCACCTAGAAAAAGTCATGACGAAACATATGCTGATTATGAGAGTGTAGATACACCATTGGTAGAACGACGAAAAAACTGGTTTCAGAGACAGATATCAAGAATGGGTTCAGTTCGATCGGCTTCGACTGATTATTCTTCTGGTGGGCTATTTGGAAGAAACCGTCATAATTCCATGGTGTACTCCAGTCCAGAGACGGGCCAGCCCACTGCACCGCCTCAACAACACCATAAGATGTCATTGTACGAGAGACTCGTGGGCAGGAAGTCAGGGAGAGGACAGCATCGACAGAATTCTAAACATGGCAGTCAAAAGAGTAATGGTTCAGCAATACCGATAACATTACGCAATCGCCCACGTATTCCTACACCGGACGTGACCAAAGAAGTTATGGACCGTGAGAACCGCATTACAATGGGGATGCAGAATATGGGGGTAATTATGGCTCACCAGGGCTACCAAAACGAGGTGCCTGTTTTAGGAGCTCTCGTGCTCTCTCCAATCCAAGAGCTTGACAGTGGCTCTGTCAACAACACCTTGCATGCCGGACAGCCTGGAACTACAGCACTAGCACAAGCTGTGCTGTCTCCTGGTCTGACTACAGGATTAACTCCAATGCTGACAGCAGCTCCTGTGGTGTCGCCGGTAACATTGTCCCCCATGGGGGTCTCACAACTTCTCGGTAGTTCGACACCATCGACACCTCGCGCAGATCGCACACCGCCTGCGCCGCGTGCCACGGTCACTGAGCTTCCGTCTGATAGCGAACACAGCGGGTCAGGGACGCCGCCGGAATTCAATAGAAAGCCAAACGGCTCTAAACGAGGAGAAGTTTATGTATAA

Protein sequence:

>DPOGS206804-PA
MTISYAGEVPNGSSFGCFWRILCKWRGSVYKLVWRELLAYLTLYYTINLLYRFALTEHQQRIFEKVRQYFGAQSESIPMSFVLGFYVSLVVKRWWEQYKLLPWPDTLALFISAGIPGADETGRLMRRNIVRYAILAYVITLQRVSLRVKRRFPTWQHVVDSGLMLESERKVFEKMDGKSPMSKYWMPLVWATNIINRARKEGLITSDHIVQTLLVELSDIRRRLGALIGYDTVCVPLVYTQVVTLALYTYFVAALMGRQLVPPAPGSTSKYEPDVYFPLFTALQFCFYVGWLKVAEVLINPFGEDDDDIELNWLIDRHIKAAYMIVDEMHEEHPELLKDQYWEEVVPKDLPYTVASEHYRRHEPPCSADHYKVKAEDAVYANVQAPRKSHDETYADYESVDTPLVERRKNWFQRQISRMGSVRSASTDYSSGGLFGRNRHNSMVYSSPETGQPTAPPQQHHKMSLYERLVGRKSGRGQHRQNSKHGSQKSNGSAIPITLRNRPRIPTPDVTKEVMDRENRITMGMQNMGVIMAHQGYQNEVPVLGALVLSPIQELDSGSVNNTLHAGQPGTTALAQAVLSPGLTTGLTPMLTAAPVVSPVTLSPMGVSQLLGSSTPSTPRADRTPPAPRATVTELPSDSEHSGSGTPPEFNRKPNGSKRGEVYV-