Monarch geneset OGS2.0

DPOGS203399
TranscriptDPOGS203399-TA4053 bp
ProteinDPOGS203399-PA1350 aa
Genomic positionDPSCF300003 + 977131-991447
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0166360.073.39% 
BombyxBGIBMGA012297-TA0.062.27% 
DrosophilaSCAP-PA0.042.68% 
EBI UniRef50UniRef50_F4X7I10.042.48%Sterol regulatory element-binding protein cleavage-activating protein n=6 Tax=Formicidae RepID=F4X7I1_ACREC
NCBI RefSeqXP_394934.20.042.61%PREDICTED: similar to SCAP CG33131-PA [Apis mellifera]
NCBI nr blastpgi|3071769430.043.39%Sterol regulatory element-binding protein cleavage-activating protein [Camponotus floridanus]
NCBI nr blastxgi|3838540020.042.88%PREDICTED: sterol regulatory element-binding protein cleavage-activating protein-like [Megachile rotundata]
Group
Gene OntologyGO:00055153.2e-31protein binding
KEGG pathwayppp:PHYPADRAFT_1135282e-17 
 K12385 (NPC1)maps-> Lysosome
InterPro domain[817-1259] IPR0110463.2e-31WD40 repeat-like-containing domain
[1273-1316] IPR0159435.9e-27WD40/YVTN repeat-like-containing domain
Orthology groupMCL13215 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203399-TA
ATGGCAGGGTCTACATTACCCGAAAAGGTTGCGCAAATATATTACACGTATGGTCTTTTCTGCTCATCTTACCCTGCAGTTGCGGTCGCGCTAGCACTCTCTGTCGTCTTGTTCTGCTGCTATCCACTTCTAAATGTACCACTACCAGGAAACATACCAGTTGTAATTAATATACATAACAAGGATCACCACAAAGTAGTGGACTGCAATGCAAATTGTGATTTTTCAGCAACATCAGTACAATTACAGTTTTTGCACAATGAAACACAAAAACTTCCACATGTGTGGGTGAAAGATAAACCGCTTCTTTATGTACATCAAATTATTATGAGAATAGGTGTGTCACCTTGGAATGACAATCTTAAAATGTGGGATGCGTTCCGTGCACCCCTTCAAGAGACATTTCGTTTACTTGAGGCAGTGAGGAATCATGAAGATCCGGAAACAAAAGAAACTCTGTTACAACACTGTTACCAAGTGGAAGGTATAAAGAGATCTGATAAGGCCACTGTTGAGACGGTACTGCCAGAATATAGCTGTCTCATTCTATCACCGGCTAATCTATGGCAACAGAATATAGACTCGTTCTCCCTTGATACAAACATAGTGAACACAGTTTATACTTATCAGGGACTTCAAAAGGGCAAGGTATCAATCGCTGAGATGGCCTTCGGCTTGCAACTGCGGGACACCGGCATCAAACGTTACCCGCTCAGAGCGAGGCCGAGGGTCTTACAATACGCTGTAACACTCTTCTATCAACATGCTGATGATAAGTTCATCAATTCCCTGACGAAGAAGCTCCGCGATATGTATCCCCTGTACCAGGACACTTCCCAGGCCTACACAGACGATATGACGCTCATATATTACCCGGGAAAGTTCAACTACCATGAACTGGTGCAGCTGATGATAACGTTCGTGGGTCTCTTCCTGTACGTTTATTTTTCGGTACGGAAGATTGAGTTCATTAAGTCCAAACTGGGCTTAGCAGCGTGCGCTGTCCTCAGTATAGCCGGCAGTCTTACAATGTCCACGGGAATATGCTTCTTCTTCGGTTTCTCTCTGAGCTTACAAGGGAAAGAAATCTTTCCGTATCTGGTGATAATAGTCGGGCTCGAGAACGTTCTGGTGCTAACCAAGAGCGTAACGTCGACGGATTCAAGATTGGACGTCAAAATCCGTCTAGCCCAAGGCCTCAGTAAGGAAGGATGGTCCATAACGAAAAATCTCCTCACAGAAATAACAATACTGACAGTCAGCTTCTTCACTTTCGTACCTTTCATACAGGAATTCTCAATTTTCATGATAGTCAGCCTATTAACAGACTACTTCATACAAATGGTGTTCTTCGCTACGATCCTCGGCATAGACGTGCGCAGAATGGAGTTCTATCCTGAGCGCAATTCAAAATTGCACTTGAAGGATTATTTCAATTCGAATAACGCTCTCAACTGGCGTTTCCACAGAGGTTATTCTGATGATAGTAGTTCACCACAAAGAATGACGAAATCCAAGTCCCATCCGCGGCTCAACGGCCTCGTTCAGACAAGTCCGACTGATGTTGTGGCCCAACGGAATCCGAGCCCCGGTTGTCAGGACCACAGAGTTCCTAAACGTATATGGCTTGTGAACATGTGGGCTAGAACACGGCTATTTCAACGAGCTTTCATGGTTTGGATGCTAGTGTGGATATCTATGATAGTGTATGACTCCGGTGTCGTAGATTATTTCATAGGCAGTGTTGAGAACAATGAAATTAACACAGTGCATAGAAAGAATATAAGGAACGAGCCAAAGCAGTCCGTGTATATGACATACAATAATGTGAACAGTACGGGAAGACATCCGCTCGTGTTCTCACCGCTCCTCGATACCTCCGATACAGAGAAGATCGCCATAGCTGCTAACGAGACGATACTATTGAAACACTCGCCATTACATCGACCGTTTTCGAGGGTGTCCCCGTACCATTGGTCCTCGATCCTGTCTCAGTACAATGAGTCGGTTGCGGGTAGATACGTGGCTATTCTGCCGCCGGTGCTAGTTAGTCATCGAGTTGGACCAGAAGTGGCGGTAGGGTTGAGGCATCCCGATGAAAAAGATCCGCCCCCGCTGAGGTGGCAGGCTCTGGCCGCTGCCTTGGATCCCATAGATACTTTACCAGAATTTGACCTCAAAGACGGTAAAGCACAGGCTCATCAGTTGGGGCAAAGTGCTGATTTGCCAATTTACCCCACAACTCCCATGGAAATATTACTCCTGGCCATACTCTGTACCATAAGTGTGGCTGTCATAGCATATATGATGGTAGTCCTATACAGATGCGTTTGTTCGCGCCACTACGCAGAATGGAGGGCGTCCTGGAACGATGACAACTGCCATAATAAGATCATAGCCAAACAGCCGGCTGTTCAGCTAGTGATGGAGGCTGTGCCGTTGGTGGTGGCCGGTCACAGTCAGGAGGTGGAGTGTCTGGTGACGGATGGTGAGAAGGTGGTCAGTTCCTGTCTCCAGGGAAACATCCGCGTATGGGACTCCCTCAACGGTGACCTCATCACTAATATCGATAGAAGCGCATACTTCAAACTTCAATCAGAGTTGTTTGACAGAGTCCACAGTAAAGCGAATTCCTCTGATGAGCATATTACTATTGAAGCATGCAAAAGTCCTGAAGATGTTACCCAAAAACAAAACACCGGACCAAGATTGCGGCGACCTCTGTCCACTCATTTGAGTAATTTACAGTTCAAGCAGAACAACAGCACCGATACCACTTATCTACCAGTATCTGAGAGCACCAAGTATGATTTCGCAAAGGCCTACAGAGATTTGTATCACACTGAACCAAATATATATGATGTCAGTGACAGCTACAGTAGTTTGATTACAAGAGACAATGAAGATGTAAGTGCGTTTAAACCAAAAACCAAAAGTGCTGTGAATAGTGTTAGTATGGATTTTGGTAGGAGTGATGGGTCAGATAATACTGGTGCTAGTTGGAGGGATCAGGCTCTGATGGACACCCCAGTGTGGTGTATGGACTTTTGTAATGATCTTATCATACTTGGCTGTGCCGATGGCAGGCTGGAGTTCTGGGAGGCCAGCACTGGAAAGTTGATGTGTGTGTATCAATCGTCAATGGCTGGTGTGAGTCACGTCCGTGTGCTGTCTTCCGGCAGGCGGGTCCTGGCCGCCAGTCTAACTGGACACCTCTCTCTCCTAAGACTGGACGCTTGCACCAGCTCGGGAGCTCACGTCGACTGGCGCTTCAGCACCGCTCATAGACGAACTCACAAACGGACGGGTTCCGCTGAGTTACTAAGAACCGGCAGCGGGTTCAATGAAAACCGCAAGAGCTTTTCATACGACGCTGACAACAATAACGACGAAGTAGTTTGCGTCCGCGTAGCTCACTGCAGGGCGCACCAGCAACCTATAACTGAACTCCAATCCGAGGGCGGGCGGGTACTTACCGGAGGGCAGGACCACGTGCTCAAGGTGTTCTCAAGTTCGGAGCTGACCGCCCTGTTGACGCTTCACGGACATTGCGGCCCCATAACCAGTTGTTTCATAGATCACGCGACGCCCACGATCGCCGGCAGCGGTTCCCAGGACGGTTTACTATGCGTTTGGGATCTTCATACAGTTGTCTGTATGCATACAATCGAAAGTATTTTTGAAGAGTCGATGTCTTTTTTAGCTGCTACCTCATATCGAGACGAACGACGAGACCTTTGTGTGCTGTCTGTTGATAATAAAATATTTAGTTTACACTCGACCTTACGGCTACGTGGTCTGAACTACATGAGTCGAATGCTGCCTCTCACTCACACACTGCTTGTGATGGGTGACCGCAGCGGATTGACAGCATACGATCTTAGCAGCGGGGATATTATACGAAGAGTCATGTTTGTTACAGGCCAGAGCGACGGCTGTATATTCGTACGTCAGATACTCCCGCTAAAGGACGCCATAGTCTGTGACTACGCGAACCAGCTTAGAATAGTCCGCTTCCCCTTAGTGTCGAAAATGGATATGAAGAATGAATAG

Protein sequence:

>DPOGS203399-PA
MAGSTLPEKVAQIYYTYGLFCSSYPAVAVALALSVVLFCCYPLLNVPLPGNIPVVINIHNKDHHKVVDCNANCDFSATSVQLQFLHNETQKLPHVWVKDKPLLYVHQIIMRIGVSPWNDNLKMWDAFRAPLQETFRLLEAVRNHEDPETKETLLQHCYQVEGIKRSDKATVETVLPEYSCLILSPANLWQQNIDSFSLDTNIVNTVYTYQGLQKGKVSIAEMAFGLQLRDTGIKRYPLRARPRVLQYAVTLFYQHADDKFINSLTKKLRDMYPLYQDTSQAYTDDMTLIYYPGKFNYHELVQLMITFVGLFLYVYFSVRKIEFIKSKLGLAACAVLSIAGSLTMSTGICFFFGFSLSLQGKEIFPYLVIIVGLENVLVLTKSVTSTDSRLDVKIRLAQGLSKEGWSITKNLLTEITILTVSFFTFVPFIQEFSIFMIVSLLTDYFIQMVFFATILGIDVRRMEFYPERNSKLHLKDYFNSNNALNWRFHRGYSDDSSSPQRMTKSKSHPRLNGLVQTSPTDVVAQRNPSPGCQDHRVPKRIWLVNMWARTRLFQRAFMVWMLVWISMIVYDSGVVDYFIGSVENNEINTVHRKNIRNEPKQSVYMTYNNVNSTGRHPLVFSPLLDTSDTEKIAIAANETILLKHSPLHRPFSRVSPYHWSSILSQYNESVAGRYVAILPPVLVSHRVGPEVAVGLRHPDEKDPPPLRWQALAAALDPIDTLPEFDLKDGKAQAHQLGQSADLPIYPTTPMEILLLAILCTISVAVIAYMMVVLYRCVCSRHYAEWRASWNDDNCHNKIIAKQPAVQLVMEAVPLVVAGHSQEVECLVTDGEKVVSSCLQGNIRVWDSLNGDLITNIDRSAYFKLQSELFDRVHSKANSSDEHITIEACKSPEDVTQKQNTGPRLRRPLSTHLSNLQFKQNNSTDTTYLPVSESTKYDFAKAYRDLYHTEPNIYDVSDSYSSLITRDNEDVSAFKPKTKSAVNSVSMDFGRSDGSDNTGASWRDQALMDTPVWCMDFCNDLIILGCADGRLEFWEASTGKLMCVYQSSMAGVSHVRVLSSGRRVLAASLTGHLSLLRLDACTSSGAHVDWRFSTAHRRTHKRTGSAELLRTGSGFNENRKSFSYDADNNNDEVVCVRVAHCRAHQQPITELQSEGGRVLTGGQDHVLKVFSSSELTALLTLHGHCGPITSCFIDHATPTIAGSGSQDGLLCVWDLHTVVCMHTIESIFEESMSFLAATSYRDERRDLCVLSVDNKIFSLHSTLRLRGLNYMSRMLPLTHTLLVMGDRSGLTAYDLSSGDIIRRVMFVTGQSDGCIFVRQILPLKDAIVCDYANQLRIVRFPLVSKMDMKNE-