Monarch geneset OGS2.0

DPOGS200308
TranscriptDPOGS200308-TA1344 bp
ProteinDPOGS200308-PA447 aa
Genomic positionDPSCF300026 - 126506-131315
RNAseq coverage477x (Rank: top 26%)
Annotation
HeliconiusHMEL0134840.090.95% 
BombyxBGIBMGA005584-TA0.080.26% 
DrosophilaSans-PB2e-9743.49% 
EBI UniRef50UniRef50_Q7QIQ91e-9742.91%AGAP007027-PA n=3 Tax=Diptera RepID=Q7QIQ9_ANOGA
NCBI RefSeqXP_001120815.12e-10345.60%PREDICTED: similar to CG13320-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3407251112e-10345.93%PREDICTED: Usher syndrome type-1G protein homolog isoform 1 [Bombus terrestris]
NCBI nr blastxgi|201299392e-9743.55%sans ortholog, isoform A [Drosophila melanogaster]
Group
Gene OntologyGO:00055151.1e-12protein binding
KEGG pathway 
InterPro domain[2-120] IPR0206831.5e-24Ankyrin repeat-containing domain
[371-434] IPR0109931.1e-12Sterile alpha motif homology
[374-431] IPR0211293.6e-11Sterile alpha motif, type 1
[370-435] IPR0016604.1e-11Sterile alpha motif domain
[380-435] IPR0137613e-10Sterile alpha motif-type
[32-61] IPR0021104.5e-06Ankyrin repeat
Orthology groupMCL11590 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200308-TA
ATGACTCCGACCTTGTGGGCTGCCTTTGAAGGTCACATAGAGGCACTGCGACTACTCTGTGGCAGAGGGGGTGAACCTGATAAATATGATTACTTCGGCAACACAGCCCTCCACCTAGCAGCTGCACGCGGTCATAAGGAATGCGTGACGTTTTTGGTAAACTTCGGTGCCAACCTGTACGCCATGGACGTAGACGGTCACACGGCCCAGGAGCTGGCCGCCATCAACGGAAGAGATGATATACTGCGTTTCCTCGACCAAACTATCGGCAAACTGGAAAATAACGACAAAAAAAAAGCGAAGTCCCTCAAAGAAAAGGCAAAGAAGGATCACGAGAAACTTCAAAAGCAGTACACTAAGAGGCAGAGTAAGGCGGAGGTGATGGCGGACAAAGAATTAAAGAAACTAGCCAAGGAATGGGACCACGGATACAACGAAGAAATAACGACCATGCCGCATAGACCAAGCAACGTGTTGCTGGCTCTGAAACAGAAAATGACACGCTCCTCGAGTCAAGGTAATCTTCTGGACGATCCTCGTCCGACGTACAGCGCGCTAGTGGGCACGGTGTCGTCAGGGGCTCGAGGCCGAGGCGCCGTCTACAAGAAAGCTCTCGCCAGCAAACTCAAGAACGGCACCCTTGGGAAAACCAGCGTTAGAGACGACTTCAAGGTAGGCGAGGTAGAGACGACAGGTCGTCGCTCGGTGACGTCATTGAGCGGAGTGCGTCGCGACTCAGAGGTCATGTACGTCGGCACCTTCGGAGCCGGTCCTCAACAAAGGGCGCCCGTCGCTGATGTCTTCACTGACAAACCATTACTCACCAGATCAGCGAGTCAACCCGACTTCTTGGCGGCGCAACAGGGGGAAGACAGCGGCATCGGACAGGAAGTGCTGCTGCAGGAACCGGCCAGTATATTTGACAGACCCGGGTTTGGTAGTGTTGCGTTTAGACGTTCCATAACAGCCACACTGAGCGCGATGCCGGCCAGCGAGGAGTTGTCCATAGGATCCGCGGGCTCCCTTGCAAGACACGCTTACCAACCAGCTGAGTGGGCCTCTACACAGTCAGGGAGTTCCACTATAACATCCGACGAGGAACCCGAGGCGGATGACACGGGATACTCGTCACTCGAACGCTTCTTGACGGCGTGGGGTCTGTCACAGTACATCCAGAAGTTCAAGGACGAGCAGATCGACCTTGACGCGCTGATGCTTCTCACCGAGAGCGACATGAAGAGCCTCGGGCTGCCGCTGGGACCGTACCGAAAGTTGGTCACAGCTGTTCAGGAGAGGAAGCAGGCTCTATCCCAACCGGGCCCCATGATAGATACCGCTATATAG

Protein sequence:

>DPOGS200308-PA
MTPTLWAAFEGHIEALRLLCGRGGEPDKYDYFGNTALHLAAARGHKECVTFLVNFGANLYAMDVDGHTAQELAAINGRDDILRFLDQTIGKLENNDKKKAKSLKEKAKKDHEKLQKQYTKRQSKAEVMADKELKKLAKEWDHGYNEEITTMPHRPSNVLLALKQKMTRSSSQGNLLDDPRPTYSALVGTVSSGARGRGAVYKKALASKLKNGTLGKTSVRDDFKVGEVETTGRRSVTSLSGVRRDSEVMYVGTFGAGPQQRAPVADVFTDKPLLTRSASQPDFLAAQQGEDSGIGQEVLLQEPASIFDRPGFGSVAFRRSITATLSAMPASEELSIGSAGSLARHAYQPAEWASTQSGSSTITSDEEPEADDTGYSSLERFLTAWGLSQYIQKFKDEQIDLDALMLLTESDMKSLGLPLGPYRKLVTAVQERKQALSQPGPMIDTAI-