用于生产芳香醇的方法
文献发布时间:2023-06-19 09:24:30
本发明是2014年9月19日申请的发明名称为“用于生产芳香醇的方法”的第201480051538.3号发明专利申请的分案申请。
技术领域
本领域涉及细胞色素P450s,及其生产倍半萜醇的用途。
背景技术
萜烃(比如α-檀香萜和β-檀香萜)通过生化途径(例如通过遗传改造的细胞)来生产。这些萜和源于该萜的醇是檀香油的主要组成成分,并且该醇是通常通过蒸馏檀香属物种(比如檀香木)的心材可商业获得的重要的香料成分。作为该醇的具体例,包括α-甜橙醇(α-sinensol)、β-甜橙醇(β-sinensol)、α-檀香醇、β-檀香醇、α-反- 香柠檬醇和表-β-檀香醇。虽然发展出了新型生化途径(包括基因工程细胞)来生成萜烃,但仍期望发现一种生化途径来生成并生产源于檀香萜的醇。更进一步期望使用一种生化途径,其不仅能生成该醇,并且更期望能够通过该生化途径来选择性地生产该醇的顺式异构体,比如α-异甜橙醇(iso-α-sinensol)、β-异甜橙醇、(Z)-α- 檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇。
细胞色素P450s代表的是氧化酶类的酶族。P450s常用来催化单加氧酶反应。基于氨基酸序列的同源性,将细胞色素P450s酶分类为族和子族。相同子族的成员共享超过55%的氨基酸序列同一性并且具有通常相似的酶活性(底物和/或产物的选择性)。 CYP71AV1(NCBI登录号ABB82944.1,SEQ ID No.51和52)和 CYP71AV8(NCBI登录号ADM86719.1,SEQID No.1和2)是CYP71AV子族的两个成员,并且共享78%的氨基酸序列同一性。 CYP71AV1已被证明可氧化紫穗槐二烯(Teoh et al,FEBS letters 580 (2006)1411-1416)。CYP71AV8已被证明可氧化(+)-朱栾倍半萜、大根香叶烯A和紫穗槐二烯(Cankar et al,FEBSLett.585(1),178-182 (2011))。
已经报道有:在使用工程细胞的工序中,使用萜合酶来催化生产二萜或倍半萜。使用细胞色素P450多肽对该二萜或倍半萜进行进一步处理,从而催化由细胞产生的二萜或倍半萜的羟基化、氧化、脱甲基化或甲基化。
发明内容
本发明提供一种用于生产倍半萜醇的方法,包括:
i)使式(I)的萜烯与多肽接触,
该多肽具有与从由SEQ ID NO:2,SEQ ID NO:4,SEQ ID NO:6, SEQ ID NO:8,SEQID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO: 34,SEQ ID NO:36,SEQ ID NO:38,SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO:52,SEQ ID NO:54,SEQ ID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:64,SEQ ID NO:66,SEQ IDNO: 68,SEQ ID NO:71,SEQ ID NO:73,SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的多肽至少有约45%序列同一性的氨基酸序列;以及
ii)任选地分离上述醇,该醇中,R是由9个碳构成的饱和的、单不饱和或多不饱和脂肪族基团,并且R可以是支链或由一个或多个非芳族环组成。
本发明更进一步提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反-香柠檬醇和表-β-檀香醇、澳白檀醇(lancelol)和/或它们的混合物,该方法包括:
i)使α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与多肽接触,从而生产醇,该多肽具有与从由SEQ ID NO:2,SEQ ID NO:4,SEQID NO:6,SEQ ID NO:8, SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34,SEQID NO: 36,SEQ ID NO:38,SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO:52,SEQ ID NO:54,SEQ ID NO:58,SEQ ID NO:60, SEQ ID NO:62,SEQ ID NO:64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO: 71,SEQ ID NO:73,SEQ ID NO:79和SEQ IDNO:81构成的群组中选出的多肽至少有约45%序列同一性的氨基酸序列;
ii)任选地分离上述醇。
本发明还提供一种生产α-甜橙醇、β-甜橙醇、α-檀香醇、β- 檀香醇,α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α-檀香萜、β- 檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450s 单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约36%的顺式异构体。
本发明更进一步提供一种分离的多肽,其具有单加氧酶活性并且包含氨基酸序列,该氨基酸序列与从由SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列有至少约45%、50%、 55%、50%、65%、70%、80%、90%、95%、98%或更多的同一性。
本发明更进一步提供一种分离的多肽,其具有单加氧酶活性并且包含氨基酸序列,该氨基酸序列与从由SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的氨基酸序列有至少约45%、50%、 55%、50%、65%、70%、80%、90%、95%、98%或更多的同一性。
本发明还提供一种分离的多肽,其具有单加氧酶活性并且包含从由SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34, SEQ ID NO:36,SEQ ID NO:71,SEQ IDNO:73,SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的氨基酸序列。
本发明更进一步提供一种用于生产倍半萜醇的方法,该倍半萜醇从由α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇和澳白檀醇或它们的混合物构成的群组中选出,该方法包括:
i)在适合于生产具有单加氧酶活性的p450多肽的条件下培养细胞,其中,该细胞:
a)生产丙烯酸焦磷酸酯萜前体;
b)表达P450还原酶;
c)表达具有α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反 -香柠檬烯和/或表-β-檀香萜的合酶活性,并且生产α-法呢烯、β- 法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜的多肽;并且
d)表达具有氨基酸序列的多肽,该氨基酸序列与从由SEQ ID NO:2,SEQ ID NO:4,SEQ ID NO:6,SEQ ID NO:8,SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34,SEQ ID NO:36,SEQ ID NO:38, SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO: 52,SEQ ID NO:54,SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ IDNO:64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:73, SEQ ID NO:79和SEQID NO:81构成的群组中选出的多肽至少有约 45%的序列同一性;并且
ii)任选地从该细胞中分离上述醇。
附图说明
具体实施方式
在一些实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:2有至少约45%、50%、55%、60%、65%、70%、80%、90%、95%或 98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:4有至少约45%、50%、55%、60%、65%、 70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:6有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:8有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:28有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:30有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:32有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:34有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:36有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:38有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:40有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:42有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:44有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:50有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:52有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:54有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:58有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:60有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:62有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:64有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:66有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:68有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:71有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:73有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:79有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:81有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。
本发明提供的用于生产多肽(其用于生产醇)的核苷酸序列具有与从由SEQ IDNO:1,SEQ ID NO:3,SEQ ID NO:5,SEQ ID NO:7,SEQ ID NO:27,SEQ ID NO:29,SEQ ID NO:31,SEQ ID NO:33,SEQ ID NO:35, SEQ ID NO:37,SEQ ID NO:39,SEQ ID NO:41,SEQ IDNO:43,SEQ ID NO: 49,SEQ ID NO:51,SEQ ID NO:53,SEQ ID NO:57,SEQ ID NO:59,SEQID NO:61,SEQ ID NO:63,SEQ ID NO:65,SEQ ID NO:67,SEQ ID NO:70, SEQ ID NO:72,SEQ ID NO:78和SEQ ID NO:80构成的群组中选出的序列至少有约45%、50%、55%、60%、65%、70%、80%、90%、95%或98%的同一性的核酸序列。本发明所提供的核苷酸序列是异源的,因为它们不是典型地也不是通常地由表达于其中的细胞来制造,并且其相对于其所引入的细胞通常不是内源的–其典型地从另一个细胞获得或可以合成出。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约36%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:28, SEQ ID NO:30,SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO: 64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的序列同一性。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约46%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:30, SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:66,SEQ ID NO: 68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、90%、 95%或98%的序列同一性。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约50%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、 50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约72%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与含有从由SEQID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、 50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约96%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO: 68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、 90%、95%或98%的序列同一性。
在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约100%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。
还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQ IDNO:70和SEQ ID NO:72构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽包含与从由SEQ ID NOs:71和SEQ ID NO:73 构成的群组中选出的氨基酸序列有至少约45%、50%、55%、50%、 65%、70%、80%、90%、95%或98%或更多的同一性的氨基酸序列。更特别的是,所编码的多肽具有从由SEQ ID NOs:71和SEQ ID NO:73构成的群组中选出的序列。
还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQ IDNO:78和SEQ ID NO:80构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽包含与从由SEQ ID NOs:79和SEQ ID NO:82 构成的群组中选出的氨基酸序列有至少约45%、50%、55%、50%、 65%、70%、80%、90%、95%或98%或更多的同一性的氨基酸序列。更特别的是,所编码的多肽具有从由SEQ ID NOs:79和SEQ ID NO:82构成的群组中选出的序列。
还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQID.NO:27,29,31,33和35构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽具有从由SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34和SEQ ID NO:36构成的群组中选出的序列。
在另一实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码与从由SEQ ID NO:71和SEQ ID NO:73 构成的群组中选出的多肽有至少约45%、50%、55%、50%、65%、 70%、80%、90%、95%或98%的序列同一性的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。
在更进一步的实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码具有从由SEQID NO:71和SEQ ID NO:73构成的群组中选出的序列的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。
在另一实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码与从由SEQ ID NO:79和SEQ ID NO:81 构成的群组中选出的多肽有至少约45%、50%、55%、50%、65%、 70%、80%、90%、95%或98%的序列同一性的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。
在更进一步的实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码具有从由SEQID NO:79和SEQ ID NO:81构成的群组中选出的序列的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。
上述醇可以转化成醛或羧酸,例如但不限于甜橙醛、檀香醛,香柠檬醛和澳白檀醛(lanceals)。所述醇、醛或酸可以进一步转化为衍生物,例如但不限于酯、酰胺、糖苷、醚或缩醛。
本发明所述的核酸和多肽可以分离自比如菊苣(Cichorium intybus L.)、巨大芽孢杆菌(Bacillus megaterium)、檀香树(Santalum Album) 和黄花蒿(Artemisia annua)。CYP71AV8,P450-BM3(CYP102A1)和 CYP71AV1包括变体均描述于此。
来自于植物菊苣(Cichorium intybus L.)的CYP71AV8已被定性为可区域选择性地氧化(+)-朱栾倍半萜生产反-诺卡醇、顺-诺卡醇和(+)-诺卡酮的P450单加氧酶。CYP71AV8也被发现催化大根香叶烯A和紫穗槐-4,11-二烯在C-12位置处的氧化(Cankar etal, FEBS Lett.585(1),178-182(2011))。野生型酶的氨基酸序列(NCBI登录号 NoADM86719.1,SEQ ID No 1和2)被用作设计在大肠杆菌(E.coli)中最优化表达的cDNA序列。
在真核生物中,P450单加氧酶是膜结合蛋白,并且这些蛋白质的N末端序列构成的膜锚定对这些酶的膜定位是至关重要的。蛋白质的这部分通常被富含脯氨酸结构域所划定,其对于酶活性的特异性的控制并不十分重要。因此,这个区域可以通过缺失、插入或突变进行修饰,而对催化活性不产生影响。然而,包含植物P450s的真核细胞色素P450s的N末端区域的特定修饰已经显示出在微生物中表达时具有对功能性重组蛋白的层级具有积极影响(Halkier et al(1995)Arch.Biochem.Biophys.322,369-377;Haudenschield et al(2000)Arch.Biochem.Biophys.379,127-136)。
在P450单加氧酶中,底物的识别和结合由被分布在蛋白质氨基酸序列的不同区域的几个氨基酸残基来控制。被定义为底物识别位点(SRS)的这些区域可以通过基于Gotoh所做的具体工作的简单的序列比对而被定位于任何P450的氨基酸序列中(Gotoh O(1992)J.Biol.Chem.267(1),83-90)。因此,在与底物相互作用并可以影响羟基化反应的区域选择性的CYP71AV8蛋白质残基为氨基酸Asn98 到Gly121、Thr198到Leu205、Lys232到Ile240、Asn282到Ala300、His355 到Arg367以及Thr469到Val 476。在这些区域内的一个或多个残基的修饰可以潜在地改变底物的特异性、其反应的立体化学性或者其区域选择性。作为通过P450来催化的反应的立体化学性的改变可见于Schalk et al(2002)Proc.Natl.Acad.Sci.USA 97(22),11948-11953。在该出版物中,植物P450酶的单个残基的变化会导致酶反应的区域专一性完全转变。
在这里“倍半萜合酶”或“具有倍半萜合酶活性的多肽”是指作为本申请的多肽,该多肽能够催化由从香叶基焦磷酸(GPP)、法呢基二磷酸(FPP)和香叶基香叶基焦磷酸(GGPP)构成的组中选出的无环焦磷酸萜前体合成为倍半萜分子或倍半萜分子的混合物。
α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜可通过例如美国专利公开号2011-0008836(公开日:2011年1月13日) 以及美国专利公开号2011-0281257(公开日:2011年11月27日) 中所述的合酶来制备,这两篇文献的整体并入于此。
根据本发明,多肽也意味着包括截短的多肽,条件是它们能够保持如任何上述实施方案所定义的P450单加氧酶活性。
在两个肽或核苷酸序列之间的同一性百分比是在已经生成这两个序列的比对时,在这两个序列中相同的氨基酸或核苷酸残基数目的函数。相同的残基定义为在这两个序列中在给定的比对位置上相同的残基。在此处使用的序列同一性的百分比是由最优比对,通过将在两个序列之间的相同残基的数目除以在最短的序列中的残基的总数然后乘以100而计算得到的。该最佳比对是同一性百分比可能是最高的比对。在该比对的一个或多个位置中的一或两个序列中引入间隙以便获得最佳比对。然后在序列同一性的百分比计算中将这些间隙作为不相同的残基来考虑。
以测定氨基酸或核酸序列同一性的百分比为目的的比对可以使用计算机程序以多种方式来实现,例如在万维网上可获得的公开可用的计算机程序。优选,可使用来自National Center for Biotechnology Information(NCBI)的地址为http://www.ncbi.nlm.nih.gov/BLAST/bl2seq/wblast2.cgi 的BLAST程序(Tatiana等,FEMSMicrobiol Lett.,1999,174:247-250, 1999),其参数设为默认,来获得肽或核苷酸序列的最优比对,以及来计算序列同一性的百分比。
特定的生物体或细胞,当其天然产生FPP时,或当其并不天然产生FPP但可经转化以产生FPP时,不管是用描述于此的核酸转化之前还是与所述核酸一起都意味着“能够产生FPP”。经转化的与天然存在的生物体或细胞相比产生更高量FPP的生物体或细胞也包括在“能够产生FPP的生物体或细胞”内。转化生物体(例如微生物)以便它们产生FPP的方法已经是本领域公知的。这些方法可以例如在文献中找到,例如在下列出版物中:Martin,V.J.,Pitera, D.J.,Withers,S.T.,Newman,J.D.和Keasling,J.D.Nat Biotechnol.,2003,21(7), 796-802(大肠杆菌(E.coli)的转化);Wu,S.,Schalk,M.,Clark,A.,Miles,R.B.,Coates,R.和Chappell,J.,Nat Biotechnol.,2006,24(11),1441-1447(植物的转化);Takahashi,S.,Yeo,Y.,Greenhagen,B.T.,McMullin,T.,Song,L., Maurina-Brunker,J.,Rosson,R.,Noel,J.,Chappell,J,Biotechnology andBioengineering,2007,97(1),170-181(酵母的转化)。
适合于在体内进行本发明方法的非人宿主生物体可以是任何非人的多细胞或单细胞生物体。在优选的实施方案中,用于在体内进行本发明的非人宿主生物体是植物、原核生物或真菌。可以使用任何植物、原核生物或真菌。特别地,可使用的植物是天然产生高数量萜的植物。在更优选的方案中,该植物选自茄科 (Solanaceae)、禾本科(Poaceae)、十字花科(Brassicaceae)、碟形花科 (Fabaceae)、锦葵科(Malvaceae)、菊科(Asteraceae)或唇形科(Lamiaceae)。例如,该植物选自烟草属(Nicotiana)、茄属(Solanum)、高粱属(Sorghum)、拟南芥属(Arabidopsis)、芸苔属(Brassica(油菜))、苜蓿属 (Medicago(紫花苜蓿))、棉属(Gossypium(棉花))、蒿属(Artemisia)、鼠尾草属(Salvia)和薄荷属(Mentha)。优选,该植物属于烟草(Nicotiana tabacum)种。
在更优选的实施方案中,用于在体内进行本发明方法的非人宿主生物体是微生物。可以使用任何微生物,但是根据更加优选的实施方案,所述微生物是细菌或酵母。最优选,所述细菌是大肠杆菌(E.coli),所述酵母是酿酒酵母(Saccharomyces cerevisiae)。
这些生物体中的一些天然不会产生FPP。为了适于进行本发明的方法,必须将这些生物体转化以产生所述前体。如以上所述,用上述任何实施方案描述的用核酸修饰之前或者同时,可以将它们原样转化。
还可以使用分离的高等真核细胞代替完整生物体作为宿主以便在体内实施本发明的方法。适合的真核细胞可以是任何非人的细胞,但是优选是植物细胞或真菌细胞。
在此处所使用的多肽指的是包含于此处标明的氨基酸序列的多肽或肽片段、以及截短的或变体多肽,条件是它们保持如以上定义的P450单加氧酶活性,而且它们与相应的多肽共有至少所定义的百分比的同一性。
变体多肽的实例是由选择性mRNA剪接或如此处所述形成多肽蛋白酶剪切而得到的天然存在的蛋白质。可归因于蛋白水解的变体包括,例如,当在不同类型的宿主细胞中表达时,由于从本发明多肽上蛋白水解移除一个或多个末端氨基酸而导致在N-或C- 末端的差异。本发明也包含如后所描述的用由本发明核酸的天然或人工突变而获得的核酸编码的多肽。
由在氨基和羧基末端融合另外的肽序列而产生的多肽变体也可用于本发明的方法。尤其是这样的融合可以提高多肽的表达,在期望的环境或表达系统中利于蛋白质的提纯或多肽酶活性的改善。这种另外的肽序列可以例如是信号肽。因此,本发明包含使用变体多肽的方法,例如通过与其它寡肽或多肽融合而获得的多肽和/或与信号肽连接的多肽。在本发明的方法中有利地还可以使用由与另外的功能蛋白(例如来源于萜生物合成路径的另外的蛋白质)融合而产生的多肽。
此处所使用的多肽指的是含有此处确定的氨基酸序列的多肽或肽片段、以及截短的或变体多肽,条件是它们保持如上定义的活性。
变体多肽的实例是由选择性mRNA剪接或此处所述形成多肽蛋白酶剪切而得到的天然存在的蛋白质。可归因于蛋白水解的变体包括,例如,当在不同类型的宿主细胞中表达时,由于从本发明多肽上蛋白水解去移除一个或多个末端氨基酸而导致在N-或C- 末端的差异。本发明还包含用如后所描述的由本发明核酸的天然或人工突变而获得的核酸编码的多肽。
由另外的肽序列在氨基和羧基末端融合而产生的多肽变体也包含于本发明的多肽之中。尤其是这样的融合可以在期望的环境或表达系统中提高多肽的表达,利于蛋白质的提纯或多肽酶活性的改善。这种另外的肽序列可以例如为信号肽。因此,本发明包括本发明的多肽的变体,例如通过与其它寡肽或多肽融合而获得的多肽变体和/或与信号肽连接的那些多肽变体。本发明的多肽还可以包括由与另外的功能蛋白(例如来源于萜生物合成路径的另外的蛋白质)融合而产生的多肽。
本发明的核酸可以定义为包括单链或双链形式的脱氧核糖核苷酸或核糖核苷酸聚合物(DNA和/或RNA)。术语“核苷酸序列”也应该被理解为包括分离片段形式或作为较大核酸组分的聚核苷酸分子或寡核苷酸分子。本发明的核酸还包含某些分离的核苷酸序列,其包括基本上不污染内源材料的那些核苷酸序列。本发明的核酸可以是截短的,条件是其编码本发明包含的如上所述的多肽。
另一个用于转化适于在体内实施本发明方法的宿主生物体或细胞的重要工具是包括本发明任何实施方案的核酸的表达载体。因此这种载体也是本发明的目的之一。
如此处所使用的“表达载体”包括任何直线的或环状的重组载体,其包括但不限于病毒载体、噬菌体和质粒。技术人员能够根据表达系统选择适合的载体。在一个实施方案中,该表达载体包括本发明的核酸,其可操作地连接到至少一种控制转录、翻译、开始和终止的调节序列,如转录的启动子、操纵子或增强子,或mRNA核糖体的结合部位,并且非强制选择地包括至少一种选择标记。当该调节序列功能上与本发明的核酸相关时,该核苷酸序列是“可操作地连接的”。
本发明的表达载体可在如下进一步公开的用于在包藏本发明核酸的宿主生物体和/或细胞中制备遗传转化的宿主生物体和/或细胞的方法和生产或制造具有P450单加氧酶活性的多肽的方法中使用。
经转化以包藏本发明至少一种核酸以便使其异源表达或过表达本发明至少一种多肽的重组非人宿主生物体和细胞也是实施本发明方法的十分有用的手段。因此,这种非人宿主生物体和细胞是本发明的另一个目的。
根据任何上述实施方案的核酸可用于转化该非人宿主生物体和细胞并且所表达的多肽可以是任何上述的多肽。
适合于在体内进行本发明方法的非人宿主生物体可以是任何非人的多细胞或单细胞生物体。在优选的实施方案中,非人宿主生物体是植物、原核生物或真菌。可以使用任何植物、原核生物或真菌。特别地,可使用的植物是天然产生高数量萜的植物。在更优选的方案中,该植物选自茄科(Solanaceae)、禾本科(Poaceae)、十字花科(Brassicaceae)、碟形花科(Fabaceae)、锦葵科(Malvaceae)、菊科(Asteraceae)或唇形科(Lamiaceae)。例如,该植物选自烟草属 (Nicotiana)、茄属(Solanum)、高粱属(Sorghum)、拟南芥属(Arabidopsis)、芸苔属(Brassica(油菜))、苜蓿属(Medicago(紫花苜蓿))、棉属(Gossypium(棉花))、蒿属(Artemisia)、鼠尾草属(Salvia)和薄荷属 (Mentha)。优选,该植物属于烟草(Nicotiana tabacum)种。
在更优选的实施方案中,用于在体内进行本发明方法的非人宿主生物体是微生物。可以使用任何微生物,但是根据更加优选的实施方案,所述微生物是细菌或酵母。最优选,所述细菌是大肠杆菌(E.coli),所述酵母是酿酒酵母(Saccharomyces cerevisiae)。
分离的高等真核细胞还可以经转化以代替完整生物体。作为高等真核细胞,可以是除酵母细胞外的任何非人的真核细胞。特别优选的高等真核细胞是植物细胞或真菌细胞。
术语“经转化”是指宿主经过遗传工程以便使其含有上述任何实施方案中所需要的每个核酸的一个、两个或更多个拷贝。优选地,术语“经转化”涉及异源表达由该核酸编码的多肽(该多肽使用所述核酸转化)以及过表达所述多肽的宿主。因此,在实施方案中,本发明提供了经转化的生物体,其中该多肽的表达量高于未经如此转化的相同生物体的表达量。
现有技术中已知有多种方法用于生成转基因宿主生物体或细胞,如植物、真菌、原核生物,或高等真核生物体的细胞培养物。适用于细菌、真菌、酵母、植物和哺乳动物细胞宿主的合适的克隆和表达载体的描述参见例如Pouwels等的Cloning Vectors:A LaboratoryManual,1985,Elsevier,New York和Sambrook等的Molecular Cloning:A LaboratoryManual,第二版,1989,Cold Spring Harbor Laboratory Press。尤其是用于高等植物和/或植物细胞的克隆和表达载体是技术人员可以得到的。参见例如Schardl等的Gene 61:1-11,1987。
转化宿主生物体或细胞以使其包藏转基因核酸的方法为技术人员所熟知。对于生产转基因植物,例如通用的方法包括:植物原生质体的电穿孔法、脂质体介导的转化法、土壤杆菌介导的转化法、聚乙二醇介导的转化法、粒子轰击法、植物细胞的显微注射法和应用病毒的转化法。
在一个实施方案中,转化的DNA被整合到非人宿主生物体和 /或细胞的染色体中,从而得到稳定的重组体系统。现有技术中任何公知的染色体整合方法可以应用于本发明的实践中,包括但不限定为重组酶介导的盒式交换(RMCE)、病毒位点特异性染色体插入法、腺病毒法和核内注射法。
“多肽变体”这里指的是这样一种多肽,其具有如上所述的活性且基本上与根据任何上述实施方案的多肽同源,但是其具有的氨基酸序列因为一处或多处缺失、插入或取代而不同于由本发明任一核酸所编码的氨基酸序列。
变体可以包括保守取代序列,意味着某一特定氨基酸残基被一具有类似理化特性的残基所取代。保守取代的实例包括:用一个脂肪族残基取代另一个脂肪族残基,例如,Ile、Val、Leu或Ala 之间相互取代;或者用一个极性残基取代另一个极性残基,例如在Lys和Arg之间、Glu和Asp之间,或Gln和Asn之间相互取代。参见Zubay,Biochemistry,1983,Addison-Wesley Pub.Co.。这类取代的效果可以用取代打分矩阵,如Altschul,J.Mol.Biol.,1991,219, 555-565中所述的PAM-120、PAM-200和PAM-250来计算。其他这类保守取代,例如具有近似疏水特性的完整区域的取代,已为本领域熟知。
天然生成的肽变体也包含在本发明范围之内。这种变体的实例是由于选择性mRNA剪接或者本文所述多肽的蛋白酶剪切而得到的蛋白质。蛋白水解引起的变体包括,例如,在不同类型宿主细胞中表达时,由本发明序列编码的多肽由于蛋白水解移除了一个或多个末端氨基酸而导致N-或C-末端的差异。
本发明多肽的变体可以用于获得例如酶活性期望的增强或减弱、区域化学(regiochemistry)或立体化学的修饰、或者底物利用或者产物分配的改变、对于底物的亲和力的增加、一种或多种想得到化合物的产量的改进的特异性、酶反应速率的增加、在特定环境(pH、温度、溶剂等等)中的更高活性或稳定性、或者在想要的表达系统中的改进的表达水平。变体或定位突变体可以通过现有技术已知的任何方法来生成。天然多肽的变体和衍生物能够通过分离其它或相同植物品系或物种(例如来自檀香属物种的植物)的天然产生的变体或者变体的核苷酸序列来获得,或者通过对编码本发明多肽的核苷酸序列进行人工规划突变来获得。天然氨基酸序列的改变可通过多种传统方法中的任一种完成。
由附属肽序列在本发明多肽的氨基和羧基末端融合产生的多肽变体可被用于增强多肽的表达,在期望的环境或表达系统中利于蛋白的纯化或提高多肽的酶活性。这种附属肽序列例如可为信号肽。因此,本发明包含本发明多肽的变体,例如通过与其它寡肽或多肽和/或连接到信号肽上的多肽融合获得的那些变体。包含在本发明范围内的融合多肽还包括由融合其它功能蛋白质如来自萜生物合成路径的其它蛋白质而产生的融合多肽。
本发明生产出的上述醇可以通过提取而从在自然界中产生的醇中分离,例如使用公知的方法来提取(例如,从檀香木中提取)。本发明产生出的醇用作可用于香料的芳香化合物。
aaCPR 黄花蒿(Armisia annua)细胞色素P450还原酶
Bp 碱基对
Kb 千碱基
DNA 脱氧核糖核酸
cDNA 互补DNA
ClASS 黄皮(Clausena lansium)(+)-α-檀香萜合酶
CPRm 椒样薄荷(Mentha piperita)细胞色素P450还原酶
DTT 二硫苏糖醇
EDTA 乙二胺四乙酸
FPP 法呢基焦磷酸
GC 气相色谱
IPTG 异丙基-D-硫代半乳糖苷
LB 溶菌肉汤
MS 质谱
MTBE 甲基叔丁基醚
PCR 聚合酶链式反应
RMCE 重组酶介导的盒式交换
RNA 核糖核酸
mRNA 信使核糖核酸
SaSAS 檀香树(Santalum album)(+)-α-檀香烯/(-)-β-檀香烯合酶
下述实施例仅为示例,并不意味着限制本发明的发明内容、说明书或权利要求书中所规定的范围。
实施例
优化的CYP71AV8 cDNA序列在细菌中的表达
CYP71AV8的膜锚定区域被重新设计以引入如下所示的修饰。
在优化的CYP71AV8序列中,修饰5’-末端以将膜锚定区域的首个氨基酸替代为显示出提升在细菌细胞中膜结合P450s的异源表达性的多肽序列(Alkier,B.A.etal.Arch.Biochem.Biophys.322,369-377 (1995),Haudenschield,et alArch.Biochem.Biophys.379,127-136(2000))。此外,作为整个cDNA,密码子的使用适于匹配大肠杆菌(E.coli)密码子的使用。因此,多个cDNA’被设计为与CYP71AV8不同的3’末端修饰和优化:
-CYP71AV8-65188:在此结构中,前22个密码子被用于编码MALLLAVFWSALIILV肽的序列代替(SEQ ID NO 3和4)。
-CYP71AV8-P2:整个锚定编码序列被源于薄荷 (Haudenschield,et alArch.Biochem.Biophys.379,127-136(2000)中的PM2) 的优化的柠檬烯羟化酶的锚定序列代替(SEQ ID NO 5和6)。
CYP71AV8-P2O:该结构编码与上述结构相同的蛋白质,但膜锚定区域更进一步进行了密码子优化(SEQ ID NO 7和8)。
在图1中,比较不同的CYP71AV8变体的N-末端区域的氨基酸序列,在图2中,比较三种结构的DNA序列。该三种优化的 CYP71AV8 cDNA在体内合成(DNA2.0,Menlo Park,CA,USA),并且克隆为NdeI-HindIII片段插入于pCWori表达质粒(Barnes,H.J. MethodEnzymol.272,3-14;(1996))。
在细菌细胞中CYP71AV8的功能性表达
对于异源表达,将CYP71AV8表达质粒转化到JM109大肠杆菌细胞(实施例1)。将转化体的单菌落用在含50μg/mL氨苄青霉素的5毫升LB培养基的接种培养物上。在37℃使细胞生长10~12 小时。然后将培养物接种到添加了50μg/m氨苄青霉素和1mM硫胺盐酸盐的250毫升TB培养基(极品肉汤)中。该培养物在28℃以适当摇动(200rpm)下温育3~4小时,之后添加75毫克/升δ-氨基乙酰丙酸(σ)和1mM IPTG(异丙基-β-D-1-硫代半乳糖苷),然后在 28℃以200rpm摇动下将培养物保持24~48小时。
P450酶的表达可以定性评估,并通过大肠杆菌的蛋白组成的 CO结合光谱(Omura,T.&Sato,R.(1964)J.Biol.Chem.239,2379-2387)定量测量。对于蛋白提取,将细胞离心(10分钟,5000g,4℃)并在 35毫升冰的缓冲液1(100mM的Tris-HCl,pH 7.5,20%甘油,0.5mMEDTA)中重悬。添加1体积的于水中的0.3mg/ml溶菌酶 (Sigma-Aldrich),并在4℃搅拌该悬浮液10~15分钟。在4℃将悬浮液于7000g离心10分钟,将粒状物在20ml缓冲液2(25mMKPO
按照此步骤,测定了重组CYP71AV8在450nm处具有最大吸光度的典型的CO光谱,验证了正确折叠形成有功能P450酶。
在细菌中CYP71AV8与植物P450还原酶的共表达
为了重组植物p450的活性,第二膜蛋白的存在是必要的。该蛋白,即P450还原酶(CPR),参与到将电子从辅助因子NAPDH(还原形式的烟酰胺腺嘌呤二核苷酸磷酸)转移到P450活性位点。已经表明来自一种植物的CPR可以完善来自另一植物的P450酶的活性(Jensen和Moller(2010)Phytochemsitry 71,132-141)。多个编码 CPRDNA序列已被报道来自不同的植物。我们首先选择一个分离自椒样薄荷(Mentha piperita)(CPRm,未公开数据,SEQ ID NO 10) 的CPR,优化全长cDNA的密码子使用(SEQ ID No 9),并且将其克隆至pACYCDuet-1表达质粒(Novagen)的NcoI和HindIII限制性位点以提供质粒pACYC-CPRm。
使用pCWori-CYP71AV8-65188和pACYCDuet-CPRm这两种质粒在大肠杆菌细胞中共表达CYP71AV8和CPRm。将这两种质粒共转化到BL21 Star
采用表达CYP71AV8的大肠杆菌细胞进行的(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的生物转化
如上所述采用被改造成根据异源甲羟戊酸途径生产法呢基二磷酸(FPP)并表达植物的倍半萜合酶的大肠杆菌细胞来制备在生物转化测定中用作底物的不同的倍半萜烃类。该大肠杆菌宿主细胞的改造以及应用如专利WO2013064411或Schalk et al(2013)J.Am.Chem.Soc.134,18900-18903所述。简而言之,制备表达质粒使其含两个操纵子,该操纵子由编码完整的甲羟戊酸途径的基因的酶组成。体外合成第一合成的操纵子(DNA2.0,MenloPark,CA,USA),该操纵子由大肠杆菌乙酰乙酰辅酶A硫解酶(atoB)、金黄色葡萄球菌的HMG-CoA合酶(mvaS)、金黄色葡萄球菌HMG-CoA还原酶 (mvaA)和酿酒酵母FPP合酶(ERG20)基因组成,并将其连接到用 NcoI-BamHI消化的pACYCDuet-1载体(Invitrogen)上,得到 pACYC-29258。包含甲羟戊酸激酶(MvaK1)、磷酸甲激酶 (MvaK2)、甲羟戊酸磷酸脱羧酶(MvaD)和异戊烯基二磷酸异构酶(idi)的第二操纵子扩增自肺炎链球菌(ATCCBAA-334)的基因组 DNA,并且将该第二操纵子连接到pACYC-29258的第二多克隆位点以提供质粒pACYC-29258-4506。因此,这种质粒包含编码引起乙酰辅酶A变为FPP的生物合成途径的任何酶的基因。质粒 pACYC-29258-4506与质粒pET101-Cont2_1(包含编码黄皮 (Clausena lansium)(+)-α-檀香萜合酶(ClASS)的cDNA,WO2009109597) 或者质粒pETDuet-SCH10-Tps8201-opt(包含编码檀香树(Santalum album)(+)-α-檀香萜/(-)-β-檀香萜合酶(SaSAS)的cDNA,WO2010067309)共转化到大肠杆菌细胞(BL21 Star
通过在使用上述作为底物列出的倍半萜分子的大肠杆菌中的生物转化来评价CYP71AV8的酶活性。如实施例3所述那样培养并获取用pACYCDuet-CPRm和pCWori-CYP71AV8-65188转化的 BL21 Star
在这些条件下,观察到(+)-α-檀香萜的氧化。转化的主要产物是(E)-α-檀香醇。检测到通过大肠杆菌内源性酶从(E)-α-檀香醇的转化而成的其他产物:(E)-α-檀香醛(通过醇脱氢酶而产生)和 (E)-α-二氢檀香醇(通过烯酸还原酶而产生)(图3A)。相似的,使用(+)-α-檀香萜或(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的混合物作为底物,观察到(E)-α-檀香醇、 (E)-β-檀香醇,(E)-α-反-香柠檬醇和(E)-表-β-檀香醇的形成,也获得了更进一步的代谢产物(图3B)。本实施例表明CYP71AV8可用于(+)-α-檀香萜、(-)-β-檀香萜以及相似结构的分子的末端氧化。
从单种质粒中构建合成操纵子以共表达CYP71AV8和CPR
数个双顺反子操纵子被设计为在特殊的启动子的控制下从单种质粒中表达P450酶和CPR。优化的CYP71AV8 cDNAs的三种变体(实施例1)与两种CPR cDNAs结合:密码子优化的CPRm cDNA(实施例2)以及用于编码黄花蒿(Artemisia annua)CPR(NCBI 登录号ABM88789.1,SEQ ID No 12)的密码子优化的cDNA(Seq ID No 11)。因此,设计出六种结构(Seq ID No 13-18),每种结构均包含P450 cDNA,接下来是包括核糖体结合位点的接头序列(RBS) 和CPR cDNA(图4)。该结构通过PCR来制备:P450和CPR cDNAs 是分别扩增的,并具有5’和3’突出端(overhang),其适于在pCWori+ 质粒的NdeI-HindIII位点中采用In-
为了评价不同N末端修饰对P450s和与CPRs耦合的影响,将六种质粒转入到大肠杆菌BL21 Star
在工程细胞中体内生产含氧倍半萜
(+)-α-檀香萜和(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯、(+)-表-β-檀香萜或其它相似结构的分子的氧化产物也可直接产自被改造成从比如葡萄糖或甘油的碳源来生产倍半萜的大肠杆菌细胞。制备出包含由P450、CPR和萜类合成酶构成的合成操纵子的pCWori+质粒构成的质粒(Barnes H.J(1996)Method Enzymol.272, 3-14)。作为P450,采用CYP71AV8-P2或CYP71AV8-P2O cDNA,作为萜类合成酶,采用黄皮(Clausena lansium)(+)-α-檀香萜合酶 cDNA(ClASS)(WO2009109597)或编码檀香树(Santalum album)(+)-α-檀香萜/(-)-β-檀香萜合酶(SaSAS)的cDNA (WO2010067309)。采用下述工序来构造出四种质粒。设计并合成出ClASS cDNA的密码子优化版本(SEQ ID NO 19-20)(DNA 2.0),并且克隆到pETDUET-1质粒(Novagen)的NdeI-KpnI位点以提供质粒pETDuet-Tps2opt。作为SaSAS,设计出优化的全长cDNA (SEQ ID NO 21-22),合成并克隆到pJexpress414质粒(DNA2.0)以提供质粒pJ414-SaTps8201-1-FLopt。设计出各种结构引物,用于采用In-
将四种质粒中的任一种与带来完整的甲羟戊酸途径的质粒 pACYC-29258-4506共转化到大肠杆菌BL21 Star
在生物转化实验中观察到所有所得的菌株产生出倍半萜烃类以及相应的含氧产物(图6)。该实验显示出使用表达CYP71AV8 的工程细胞,可生产出倍半萜(E)-α-檀香醇、(E)-β-檀香醇和其它相似结构的分子。
使用CYP71AV8变体以生产(E)-α-檀香醇和(E)-β-檀香醇
根据上述实施例,我们示出了CYP71AV8对(+)-α-檀香萜和 (-)-β-檀香萜的“末端反式碳”具有高度选择性,并且专门生产(E)-α- 檀香醇、(E)-β-檀香醇。在本实施例中,我们描述了一种定点诱变的方法来修饰CYP71AV8酶活性,以便产生(Z)-α-檀香醇和(Z)-β-檀香醇。首先选出L358作为控制酶活性的活性位点残基。 CYP71AV8的一系列变体通过将用作编码L358的密码子替代为用于编码其它氨基酸的密码子从而生产出。通过两步PCR工序来导入突变,该两步PCR工序使用简并寡核苷酸(包含NBT(N=A,C,G,T; B=C,G,T)密码子以替代L358编码密码子)和特异性寡核苷酸的组合。此种寡核苷酸的组合允许将L358编码密码子变为编码包括所有具有疏水性侧链的氨基酸的其它12种残基的密码子。采用诱变反向引物AV8-L358-rev (5'-CACGCGGCATCACCAGCGGAVNCGGCGGATGCAGGCGCAGGGTTTCTTTAATC-3')和引物AV8-pcw-fw(5’- CATCGATGCTTAGGAGGTCATATGGCTCTGTTATTAGCAG-3’) 实施第一步PCR以扩增cDNA的5’部分。采用引物AV8-L358-fw (5'-TCCGCTGGTGATGCCGCGTGAGTGC-3')和AV8-CPR-rev(5'- ATATATCTCCTTCTTAAAGTTAGTCGACTCATTAGGTG-3')来扩增第二PCR产物。对于这两步扩增,均采用 pCWori-CYP71AV8-P2-CPRm-ClASS作为模板。第二轮扩增采用上述两种PCR产物作为模板和引物AV8-L358-fw+ AV8-CPR-rev,并且允许扩增全长CYP71AV8变体cDNAs。所有 PCR反应均可按照生产商的指导而采用PfuUltra II融合HS DNA 聚合酶(Stratagene)。可采用Gibson装配预混液(NewEngland Biolabs),将经修饰的cDNA连接到通过NdeI-SalI消解的 pCWori-CYP71AV8-P2-CPRm-ClASS。最终结构通过测序而被控制,并且可为各种期望的CYP71AV8变体选择一种质粒克隆体。也可通过将Leu358替代为Ala、Phe、Thr、Ser、Val、Gly、Ile、Met、 Pro、Tyr、Trp和Arg来生产出其它12种变体(SEQ ID NO 27~50)。
采用如实施例6所述的体内倍半萜生产方法来进行各种 CYP71AV8变体的评价。简单来说,含有任一种CYP71AV8变体 cDNA、CPRm cDNA和ClASS cDNA的pCWori+质粒与pACYC-29258质粒一起共转化到KRX大肠杆菌细胞(Promega)中。如实施例6所述的那样选择并培养经转化的细胞,并且评价倍半萜的生产。如图7所示,与野生型P450酶相比,生产出除了反式氧化产物的某些变体(Z)-α-檀香醇。对于各变体,通过将所生产的 (Z)-α-檀香醇的总量除以含氧α-檀香萜衍生物的总量来计算顺反式氧化率。各种变体的此种计算的结果示于如下表1:
表1.CYP71AV8野生型酶的区域选择性以及α-檀香萜的氧化的活性位点变体
表1中所示的上述数据表现出,CYP71AV8可被改造并用于生产(Z)-α-檀香醇。特别是L358T、L358S、L358A和L358F变体可被用作以高达46%的顺式末端碳的选择性进行的(+)-α-檀香萜的末端氧化.
在类似方法中,评价了CYP71AV8变体的(Z)-β-檀香醇的生产。通过将上述质粒中的ClASS cDNA替代为SaSAS cDNA来制备新的质粒。因此,可通过限制酶HindIII和EcoRI来消解质粒 pCW-CYP71AV8-L358F-CPRm-ClASS,从而去除ClASS cDNA。同时,可通过相同的酶来消解pCWori-CYP71AV8-P2-CPRm-SaSAS,从而恢复SaSAS的cDNA兼容粘性末端。采用T4DNA连接酶 (NEW England Biolabs)将线性化载体与消解的插入段连接。如上所述,使用如此获得的质粒用于在相同条件下在大肠杆菌细胞中进行的含氧倍半萜的体内生产。图8示出通过CYP71AV8-L358F 形成的产物的分析的GCMS概况,显示出经修饰的CYP71AV8酶也可用于生产(Z)-β-檀香醇。
CYP71AV族的其它成员的评价
通过具有檀香萜骨架的倍半萜的氧化来评价CYP71AV1 (NCBI登录号ABB82944.1)。制备出结构类似于实施例5的质粒的质粒:设计出包含用于编码N-末端经修饰的CYP71AV1蛋白质 (SEQ ID NO 53和54)的优化的cDNA和aaCPR cDNA(实施例5) 的双顺反子操纵子,并且进行体内合成(DNA2.0),并作为双顺反子操纵子而克隆至pCWori+质粒。上述质粒用于转化KRX大肠杆菌细胞(Promega)。如实施例3那样培养经转化的细胞并且引起蛋白质表达。如实施例4那样进行使用(+)-α-檀香萜作为底物的生物转化实验。如图9所示,可获得与CYP71AV8相同的产物(即(E)-α- 檀香醇和(E)-α-檀香醛),这显示出CYP71AVP450族的其它成员也可被用于檀香萜的末端氧化。
使用CYP71AV1,制备出包含CYP71AV1 cDNA、aaCPR和 (+)-α-檀香萜合酶cDNA(ClASS)的合成操纵子。通过NdeI和 HindIII来消解包含CYP71AV8-P2-CPRm-ClASS操纵子的 pCWori+质粒(实施例6),从而切除P450编码cDNA。同时,如上一段中所述的那样,以相同的酶来消解从而从双顺反子操纵子恢复出CYP71AV1 cDNA,采用T4 DNA连接酶(NEWEngland Biolabs)而连接至经消解的上述pCWori质粒,产生质粒 pCWori-CYP71AV1-CPRm-ClASS。该质粒与质粒 pACYC-29258-4506被用于共转化大肠杆菌BL21 Star
P450-BM3(CYP102A1)突变体库的构建
通过将五种疏水性氨基酸(丙氨酸,缬氨酸,苯丙氨酸,亮氨酸和异亮氨酸)系统地合并到靠近P450-BM3的血红素的中心的两个位置,从而构建出24种变种的P450-BM3突变体库。改变这两个氨基酸的侧链大小已经显示出可显著改变紧靠在血红素基团的底物结合腔的形状(Appl Microbiol Biotechnol 2006,70:53;Adv Synth Catal 2006,348:763)。已知晓第一热点(Phe 87)改变底物的特异性和区域选择性,同时,也已预测到第二位置(Ala328)在氧化时与所有底物相互作用(ChemBiochem 2009,10:853)。可采用 QuickChange
α-檀香萜:P450-BM3库的体外筛查
如先前报道的那样,将24种P450-BM3突变体和酶的野生型版本异源表达到大肠杆菌BL21(DE3)细胞中(Adv.Synth.Catal.2003, 345:802)。简单来说,将经转化细胞的单菌落用于接种到2毫升的添加了30μg/ml卡那霉素的LB培养基中,并且伴随着轨道振荡(150rpm)在37℃下生长,直至OD
如实施例4所述,制备在生物转化实验中用作底物的α-檀香萜。转化在包含~0.5μM CYP酶、2%(v/v)DMSO和0.2mMμ-檀香萜底物的1ml的50mM磷酸钾缓冲液中进行。通过加入0.1mM NADPH开始反应,并且伴随着温和摇动在室温下进行22小时.
然后,在装有FS-Supreme-5色柱(30m×0.25mm×0.25μm)的 GC/MS QP-2010仪器(Shimadzu,Japan)上分析样品,氦作为载气 (流速:0.68毫升/分钟;线速度:30厘米/秒)。使用电喷雾离子化来收集质谱。注射器温度设定为250℃。色柱烘箱设为50℃ 1min, 然后以30℃/min的速度升温到170℃,随后以5℃/min的速度升温到185℃,维持等温3min,然后以5℃/min的速度升温到200℃,然后以30℃/min的速度升温到300℃,最后维持等温1min。
P450-BM3库的α-檀香萜体内筛查
同样使用被改造成从简单碳源生产(+)-α-檀香萜的细菌菌株以在体内筛查P450-BM3突变体库。为此,将实施例4的FPP-高产菌株用含有源于黄皮(Clausena lansium)(ClASS)(WO2009109597) (SEQ ID No 19和20)的密码子优化版本(+)-α-檀香萜合酶的pETDuet-1质粒进行转化,并且分别将各P450-BM3变体克隆到载体的第一和第二多克隆位点(MCS)。或者,将(+)-α-檀香萜合酶 cDNA克隆至pET101表达质粒(Novagen)中,并且将来自库的各 P450-BM3突变体克隆到pCDFDuet-1载体(Novagen)中。所得到的重组载体被共转化到FPP-高产菌株中。
将经转化细胞的单菌落用于接种到5毫升的添加了合适抗生素的LB培养基中。以250rpm在37℃下过夜温育培养物。第二天,将200μl的过夜培养物接种到添加了3%甘油、1mM盐酸硫胺素(Sigma-Aldrich,St Louis,MI)和75μg/Lδ-氨基乙酰丙酸 (Sigma-Aldrich)的2mL的极品肉汤(TB)培养基中,并且以250rpm 在37℃下温育。在4~6小时的培养后(或当培养物在600nm的光密度达到2~3的值时),将培养物冷却到28℃,并且通过0.1mM IPTG来引起蛋白质表达。在此时,将10%(v/v)的十二烷加入到生长培养基。在伴随轨道振荡(250rpm)温育48h后,通过1体积的甲基叔丁基醚(MTBE)来对细胞培养基进行两次萃取,并且通过 GC/MS分析溶剂萃取物。在装有DB1色柱(30m x 0.25mm x 0.25mm膜厚;Agilent)以及5975系列质谱仪的Agilent 6890系列 GC系统中进行GC/MS。载气为恒定流速1毫升/分钟的氦气。注射处于无分流模式,并且将进样口温度设定在250℃,烘箱温度设定为以10℃/min的速度从50℃到225℃,然后以20℃/min的速度升到320℃。基于保留指数的一致性和可信标准的质谱来确认产物的身份。
P450-BM3突变体库的体外(实施例10)与体内筛查提供了可比较的结果,将该结果归纳于表2。P450-BM3野生型(SEQ ID No 55和56)没有显示出(+)-α-檀香萜有任何可检测到的活性,而6种P450-BM3变体能将α-檀香萜转换为所期望的α-檀香醇。这些变体揭示出对于(+)-α-檀香萜的顺式末端碳的氧化的45%~96%的优选性。单一突变体#23(A328V)(SEQID No 67和68)和双突变体#7 (F87I/A328I)(SEQ ID No 57和58)、#17(F87V/A328I)(SEQID No 59和60)和#18(F87V/A328L)(SEQ ID No 61和62)显示出在 72%~96%范围内的最高的区域选择性(表2和图10)。两种附属变体#19(F87V/A328V)(SEQ ID No 63和64)和#20(F87V/A328F)(SEQ ID No 65和66)对于顺式羟基化的选择性较差 (45%~50%的范围),并且会生成附属氧化产物。
表2.通过P450-BM3变体的α-檀香萜向α-檀香醇的转化
上述结果表明:P450-BM3活性位点突变能够使非天然底物 (+)-α-檀香萜结合。选定的P450-BM3变体引入了这些突变体表明:有选择地使(+)-α-檀香萜的顺式末端碳羟基化,从而生产嗅觉明显的化合物(Z)-α-檀香醇(图10)。
使用P450-BM3双突变体的(Z)-α-檀香醇、(Z)-β-檀香醇、 (Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇的体内生产
测试在α-檀香萜筛查中确定的一种P450-BM3变体(变体#17;表2)的氧化由(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)- 表-β-檀香萜构成的倍半萜烃的檀香油状混合物的能力。为此,将实施例4所述的FPP-高产细菌菌株用包含编码檀香树(Santalumalbum)(+)-α-檀香萜/(-)-β-檀香萜合酶(WO2010067309)(SEQ ID No 21和22)的密码子优化cDNA的重组pETDuet-1表达载体转化到第一MCS中,并且将P450-BM3变体#17转化到第二MCS中。细胞生长、引发条件、培养物提取和产物分析基本上如实施例11 中记载的那样进行。
如图11所示,可通过P450-BM3双突变体将(+)-α-檀香萜、 (-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜高效地氧化,从而生产(Z)-α-檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β- 檀香醇。值得一提的是,在该实验条件下,只能检测到倍半萜醇的所期望的顺式异构体。这些数据显示出巨大芽孢杆菌(Bacillus megaterium)CYP102A1(P450-BM3)可高效地改造以选择性地使 (+)-α-檀香萜、(-)-β-檀香萜以及其它结构相关的萜烯(比如香柠檬烯倍半萜)的顺式末端碳羟基化,从而生产出发现于檀香油中的关键的倍半萜醇。
从B&T World Seeds(Aigues-Vives,France)和Sandeman Seeds (Lalongue,France)获得檀香树的种子。通过2.5%的次氯酸(HCIO)对种子进行120分钟的第一次表面灭菌,并且在灭菌超纯水中洗涤三次。然后将种子去壳并置于添加了15克/升的蔗糖和7.8g/L 的琼脂、pH值5.7的MS基础培养基上(Murashige&Skoog,1962, PhysiologiaPlantarum 15,473-497)。在9~18天后,通常观察到大约40%的发芽率。在发芽后5~10周,将从无菌发芽的种子中获得的檀香幼苗转移到土壤中。由于檀香品种是根半寄生物,因此土壤适于近距离接触6个月~1岁龄的柑桔(甜橙Citrus sinensis)植物。收获檀香植物的根,并且将其转移至土壤中2-3年后,从宿主植物的根处分离。这些根的提取物的GC-MS分析显示出檀香油所特有的倍半萜的存在。采用Concert
采用Illumina总RNA测序技术和Illumina HiSeq 2000测序仪来对整体转录物进行测序。产生出108.7百万个配对read 2×100 bp。采用CLC-生物基因组工作平台的DeNovo整合应用程序 (CLCBo,Denmark)来整合测定序列。总共将平均长度683bp的 82’479个contig整合。采用tBlastn algorithm来检索这些contig (Altschul et al,J.Mol.Biol.215,403-410,1990),并且用例如CYP71AV1 序列(NCBI登录号ABB82944.1)的已知P450氨基酸序列作为查询序列。本方式允许识别编码有特征性细胞色素P450基序的蛋白质的数个contig。某个选定的contig SCH37-Ct816(SED ID NO 69) 包含编码500个氨基酸的蛋白SaCP816(SEQ ID NO 71)的1503bp 长度的开放阅读框架(ORF)(SEQ ID NO 70)。该氨基酸显示出与已知的细胞色素P450序列(最为接近的源自欧亚种葡萄(Vitisvinifera) 的P450,CYP71D 10(NCBI登录号AAB94588.1))具有同源性,具有62%的氨基酸序列同一性。
作为通过SCH37-Ct816编码的蛋白质的功能特性,该蛋白质在大肠杆菌细胞中异源表达。对ORF序列进行修饰来提高在大肠杆菌内的表达:用编码MALLLAVFWSALIILV肽的密码子来代替前17个密码子,并且对整个ORF序列的密码子使用进行修饰以便匹配大肠杆菌密码子使用。用于编码经修饰的SaCP816(SEQ ID NO 73)的本cDNA SaCP120293(SEQ ID NO72)在体外合成 (DNA2.0)并克隆到pJExpress404质粒(DNA2.0)中。如实施例2所述那样进行异源表达。
双顺反子操纵子被设计为在独特的启动子的控制下从单一的质粒中表达P450酶和CPR。优化的SaCP120293cDNA与CPRm cDNA(SEQ ID No 9,实施例3)结合以便制备顺序包含P450 cDNA、包括核糖体结合位点(RBS)的接头序列和CPRm cDNA的双顺反子结构(SEQ IDNO 74)。该结构是通过用PCR分别扩增 P450和CPR cDNAs来制备的,并具有5’和3’突出端,其适于在 pCWori+质粒(Barnes H.J(1996)Method Enzymol.272,3-14)的 NdeI-HindIII位点中采用In-
JM109大肠杆菌细胞用SaCP816-CPRm-pCWori表达质粒进行转化。经转化的细胞进行生长,并且如实施例2所述那样制备包含重组蛋白的无细胞提取物。该蛋白组分用于倍半萜分子的酶促转化的评价(实施例16)。
如实施例4所述那样制备在生物转化实验中用作底物的不同倍半萜烃。
将提取自表达重组SaCP816和CPRm蛋白(实施例15)的大肠杆菌细胞的粗蛋白用于这些倍半萜分子的体外氧化。在包含20~50 微升蛋白质提取物、500微M NADPH(还原了的烟酰胺腺嘌呤二核苷酸磷酸)、5微M FAD(黄素腺嘌呤二核苷酸)、5微M FMN (黄素单核苷酸)和300微M倍半萜(即(α)-檀香萜或(+)-α-檀香萜、 (-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的混合物)的1 mL的100mM Tris-HCL pH 7.4缓冲液中进行该实验。在聚四氟乙烯密封玻璃管中伴随温和搅拌温育2个小时后,在冰上停止反应,并通过1体积MTBE(甲基叔丁基醚,Sigma)来萃取。如实施例4所述那样通过GCMS来分析提取物。
在这些条件下,观察到了(+)-α-檀香萜、(-)-β-檀香萜、(-)-α- 反-香柠檬烯和(+)-表-β-檀香萜的氧化。图12表示通过SaCP816 对(+)-α-檀香萜进行氧化来提供(Z)-α-檀香醇的情况。图13表示通过SaCP816来氧化(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜生成(Z)-α-檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇的情况。在所有实验中,均没有观测到可检测量的倍半萜醇的相应反式异构体(每种倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离)。
本实验显示出,从檀香树(Santalum album)分离的细胞色素 P450酶SaCP816可用于选择性羟基化(+)-α-檀香萜、(-)-β-檀香萜和相似倍半萜结构的顺式末端碳。
(+)-α-檀香萜以及(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯、(+)-表-β-檀香萜或其它相似结构的分子的氧化产物均可直接产自被改造成从比如葡萄糖或甘油的碳源来生产倍半萜的大肠杆菌细胞。制备由包含合成操纵子的pCWori+质粒构成的质粒,该合成操纵子由SaCP120293cDNA(SEQ ID No 72)、CPRm cDNA (SEQ ID No 9)、萜类合成酶编码cDNA构成。作为萜类合成酶,使用黄皮(Clausena lansium)(+)-α-檀香萜合酶cDNA(ClASS) (WO2009109597)或用于编码檀香树(Santalum album)(+)-α-檀香萜 /(-)-β-檀香萜合酶(SaSAS)(WO2010067309)的cDNA。
通过与如实施例6所述的工序相似的工序来构建出两种质粒。如实施例6那样扩增密码子优化的(+)-α-檀香萜合酶cDNA(SEQ ID NO 19)和(+)-α-檀香萜/(-)-β-檀香萜合酶cDNA(SEQ ID NO 21),并采用In-
将这两种质粒中任一种与带来完整的甲羟戊酸途径的质粒 pACYC-29258-4506共转化到大肠杆菌XRX细胞(Promega)中进行这些操纵子的性能评价(实施例4)。从羧苄青霉素(50μg/ml)和氯霉素(34μg/ml)的LB-琼脂糖平板上选出经转化的细胞。使用单个菌落接种到补充有合适的抗生素的5毫升LB培养基中。将培养物以250rpm在37℃下温育过夜。第二天,将在玻璃培养管中包含 100μg/L羧苄青霉素和17μg/l氯霉素的2mL TB培养基接种到200μl LB预培养物,并且以250rpm并在37℃下温育。在6个小时的培养后(或当培养物在600nm的光密度达到值3时),将培养物冷却至20℃,并且通过添加0.1mM IPTG(异丙基β-D-1-硫代半乳糖苷)、δ-氨基乙酰丙酸(Sigma)和2%(v/v)的癸烷来引起蛋白质的表达。在伴随250rpm的摇动的温育48小时后,如实施例4所述,通过1体积的MTBE来提取整个培养液并通过GCMS 进行分析。
在体外实验中,也发现了所有所得的菌株生产出倍半萜烃和相应的含氧产物(图14)。该实验显示出使用表达SaCP816的工程细胞,可生产出倍半萜(Z)-α-檀香醇、(Z)-β-檀香醇和其它相似结构的分子。
如实施例13所述,在转录自檀香树(Santalum album)根中确认多种P450-编码contig序列。除SCH37-Ct816以外,选出另一 contig序列:SCH37-
作为通过SCH37-Ct10374编码的蛋白质的功能特性,该蛋白质在大肠杆菌细胞中异源表达。对ORF序列进行修饰来提高在大肠杆菌内的表达:用编码MALLLAVFWSALII肽的密码子来代替前18个密码子,并且对整个ORF序列的密码子使用进行修饰以便匹配大肠杆菌密码子使用。用于编码经修饰的SaCP10374(SEQ ID NO 81)的新型cDNA SaCP120292(SEQID NO 80)在体外合成 (DNA2.0)并克隆到pJExpress404质粒(DNA2.0)中。
如实施例2所述那样进行异源表达。按照此步骤,测定新型重组檀香树(S.abum)P450在450nm处具有最大吸光度的典型的CO 光谱,验证了正确折叠形成有功能P450酶。
为了重新构建该P450酶的活性,共表达P450还原酶。为此目的,以如实施例15所述的类似方式设计双顺反子操纵子,以便在独特的启动子的控制下从单个质粒中表达SaCP10374和CPRm (薄荷P450还原酶)。优化的SaCP12092 cDNA与CPRm cDNA 结合以便制备顺序包含P450 cDNA、包括核糖体结合位点(RBS) 的接头序列和CPRm cDNA的双顺反子结构(SEQ ID NO 82)。该结构是通过如实施例15所述的PCR制备的。并且将其克隆到 pCWori+质粒(Barnes H.J(1996)Method Enzymol.272,3-14)中以提供质粒SaCP10374-CPRm-pCWori。
JM109大肠杆菌细胞用这些双顺反子表达质粒进行转化。经转化的细胞进行生长,并且如实施例2所述那样制备包含重组蛋白的无细胞提取物。这些膜蛋白组分用于倍半萜分子的酶促转化的评价(实施例21)。
如实施例4所述那样制备在生物转化实验中用作底物的不同的倍半萜烃(即(α)-檀香萜或(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反- 香柠檬烯和(+)-表-β-檀香萜的混合物)。
将提取自表达重组SaCP10374和CPRm蛋白(实施例20)的大肠杆菌细胞的粗蛋白用于倍半萜分子的体外氧化,如实施例16所述那样进行实验。在聚四氟乙烯密封玻璃管中伴随温和搅拌而温育2个小时后,在冰上停止反应并通过1体积MTBE(甲基叔丁基醚,Sigma)来萃取。如实施例4所述那样通过GCMS来分析提取物。
在这些条件下,观察到了通过SaCP10374进行的(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的氧化。图 15、16表示通过SaCP10374对(+)-α-檀香萜、(-)-β-檀香萜、(-)-α- 反-香柠檬烯和(+)-表-β-檀香萜进行氧化来提供(E)-α-檀香醇、 (E)-β-檀香醇、(E)-α-反-香柠檬醇和(E)-表-β-檀香醇的情况。在所有实验中,均没有观察到可检测量的倍半萜醇的相应的反式异构体(每种倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离)。
本实验显示出,从檀香树(Santalum album)分离的细胞色素 P450酶SaCP10374可用于选择性羟基化(+)-α-檀香萜、(-)-β-檀香萜和相似倍半萜结构的顺式末端碳。
采用如实施例4的方法,制备出类似于檀香萜的多种倍半萜烃。采用包含编码檀香树(Santalum album)(-)-倍半香桧烯B合酶 (NCBI登录号ADP37190.1)SaTps647的cDNA或编码檀香树 (Santalum album)(-)-β-甜没药烯合酶(NCBI登录号ADP37189.1) SaTps30的cDNA中任一种的pETDuet表达质粒与如实施例4所述的pACYC-29258-4506质粒的结合,来生产(-)-倍半香桧烯B和 (-)-β-甜没药烯。从Bedoukian(Dambury,Ct,USA)获得β-法呢烯,从Treatt(Suffolk,UK)获得α-法呢烯,并且从柑橘油中提纯出(-)-α- 反-香柠檬烯。
将提取自表达重组SaCP816或SaCP10374的大肠杆菌细胞的粗蛋白连同CPRm蛋白(实施例15和20)用于倍半萜分子的体外氧化。如实施例16所述那样进行通过GCMS分析的实验和产物确定。
在这些条件下,观察到了(E)-β-法呢烯、(E)-α-法呢烯、(-)-倍半香桧烯B、(-)-β-甜没药烯和(-)-α-反-香柠檬烯的氧化(图17~21)。对于所有这些化合物,檀香树(S.album)P450s对于终端偕二甲基基团(图27中的R1或R2)的两个碳原子中的其中一个具有区域选择性。SaCP816催化在相对于末端双键的顺式位置(图27的R1)的甲基碳原子的选择性氧化,然而SaCP10374仅在相对于末端双键的反式位置(图27中的R2)的甲基基团的碳原子处催化相同底物的氧化。各倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离。因大肠杆菌内源乙醇脱氢酶活性,当反式甲基氧化时,那么会形成相应的醛。
这些实验显示出,分离自檀香树(Santalum album)的细胞色素 P450酶、SaCP816和SaCP10374可分别用于选择性羟基化具有类似于β-法呢烯、α-法呢烯、(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反- 香柠檬烯、(-)-倍半香桧烯B或(-)-β-甜没药烯的结构的各种倍半萜分子的顺式末端和反式末端的碳。
如实施例21和22所述的氧化的倍半萜分子也可使用被改造成从比如葡萄糖或甘油的碳源生产倍半萜的大肠杆菌细胞中直接生产出。制备出由包含合成操纵子的pCWori+质粒构成的质粒,该合成操纵子由SaCP120293cDNA(SEQ ID No 72)或SaCP120292 (SEQ IDNo 80)、CPRm cDNA(SEQ ID No 9)、萜类合成酶编码 cDNA(编码黄花蒿(Artemisia annua)β-法呢烯合酶cDNA(NCBI登录号AAX39387.1.1)、云杉(Picea abies)α-法呢烯合酶(NCBI登录号AAS47697.1)、檀香树(S.album)(-)-倍半香桧烯B(NCBI登录号ADP37190.1)、檀香树(S.album)(-)-β-甜没药烯合酶(NCBI登录号ADP37189.1)、黄皮(Clausena lansium)α-檀香萜合酶(NCBI登录号ADR71055.1)或檀香树(S.album)α-/β-檀香萜合酶(NCBI登录号ADP30867.1))构成。
带有合成操纵子的不同组合的质粒按照如下工序来制备。分别通过(E)-β-法呢烯合酶和(E)-α-法呢烯合酶cDNAs以PCR方式扩增质粒pD444-SR-AaBFS(包含编码黄花蒿(Artemisia annua) (E)-β-法呢烯合酶(NCBI登录号AAX39387.1)AaBFS的优化的 cDNA)、质粒pD444-SR-PaAFS(包含编码云杉(Picea abies)(E)-α- 法呢烯合酶(NCBI登录号AAS47697.1)PaAFS的优化的cDNA)。将质粒pETDuet-SaTps647和pETDuet-SaTps30(实施例22)作为模板,并且分别通过倍半香桧烯B合酶和甜没药烯合酶cDNAs以 PCR的方式扩增。对于每个结构,设计引物用于采用In-
将PCR产物连接到采用HindIII restriction酶消解的质粒 SaCP816-CPRm-pCWori(SEQ ID No 74)或 SaCP10374-CPRm-pCWOri(SEQ ID NO 82)中,并且采用 In-
如实施例17所述那样采用上述质粒在大肠杆菌细胞中进行含氧倍半萜的体内生产。在体外实验中也可观察到,所有用这些质粒转化的重组细菌细胞生产出了所期望的倍半萜烃以及相应的含氧产物(图22~26)。
序列表
<110> 弗门尼舍有限公司
<120> 用于生产芳香醇的方法
<130> P219547WO
<150> US 61880149
<151> 2013-09-19
<160> 104
<170> PatentIn version 3.5
<210> 1
<211> 1500
<212> DNA
<213> 菊苣(Cichorium intybus)
<400> 1
atggagattt ctatccccac tacccttggc cttgccgtca tcatcttcat cattttcaag 60
ttgctaacgc gtaccacatc aaagaaaaac ctactcccag agccatggag actaccaata 120
atcggacaca tgcatcatct gataggtacg atgccacatc gtggtgtcat ggaactagcc 180
aggaagcatg gatctctcat gcatctacaa cttggagaag tgtccactat tgtggtctca 240
tccccacgtt gggcaaaaga ggttctgaca acgtacgata ttacgtttgc aaacagaccg 300
gagactttaa ccggtgagat tgttgcatat cacaataccg atattgtcct tgctccgtat 360
ggtgaatact ggaggcagtt gcgaaagctt tgcaccttgg agcttttaag caacaagaaa 420
gtgaagtcgt ttcagtccct tcgtgaggag gaatgttgga atctggttaa agacattcga 480
tcaactgggc agggatcccc aatcaatctt tcagaaaaca ttttcaagat gattgccacc 540
atacttagta gggcagcatt cggaaaggga atcaaagacc aaatgaaatt tacagaatta 600
gtaaaagaaa tactaaggct tacgggaggt tttgatgtgg cggacatctt tccttctaaa 660
aagttacttc accatctttc aggcaagaga gctaagttaa ccaacataca caataagctt 720
gacaatttga tcaacaatat catcgctgag caccctggaa accgtacaag ctcatcacag 780
gagactctac ttgatgttct gttaagactg aaagaaagcg cagagtttcc attgacagca 840
gacaatgtca aagcagtcat tttggatatg tttggagctg gcacggatac ttcgtcagcc 900
acaattgaat gggcaatctc agaattgata aggtgtccga gagccatgga gaaagttcaa 960
acagaattaa ggcaagcact aaatggaaag gaaaggatcc aagaagaaga tctacaggaa 1020
ctaaattacc taaagctagt gatcaaagaa acattgaggt tgcatccacc actaccgttg 1080
gttatgccta gagagtgtag ggagccatgt gtgttggggg gatacgatat acccagcaag 1140
acgaaactta ttgtcaacgt gtttgccata aacagggatc ctgaatactg gaaagatgct 1200
gaaactttca tgccagagag atttgaaaac agccccatca ctgtaatggg ttcagagtat 1260
gagtatctcc cgtttggtgc aggaagaaga atgtgtccag gcgctgccct tggtttagcc 1320
aacgtggaac ttcctcttgc tcatatactt tactacttca attggaagct cccaaatgga 1380
aaaacatttg aagacttgga catgactgag agctttggag ccactgtcca aagaaagacg 1440
gagttgttac tagttccaac ggatttccaa acacttacgg catctactta atgactcgag 1500
<210> 2
<211> 496
<212> PRT
<213> 菊苣(Cichorium intybus)
<400> 2
Met Glu Ile Ser Ile Pro Thr Thr Leu Gly Leu Ala Val Ile Ile Phe
1 5 10 15
Ile Ile Phe Lys Leu Leu Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu
20 25 30
Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile
35 40 45
Gly Thr Met Pro His Arg Gly Val Met Glu Leu Ala Arg Lys His Gly
50 55 60
Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser
65 70 75 80
Ser Pro Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe
85 90 95
Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn
100 105 110
Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg
115 120 125
Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn Lys Lys Val Lys Ser Phe
130 135 140
Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Asp Ile Arg
145 150 155 160
Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys
165 170 175
Met Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys
180 185 190
Asp Gln Met Lys Phe Thr Glu Leu Val Lys Glu Ile Leu Arg Leu Thr
195 200 205
Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Leu Leu His
210 215 220
His Leu Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu
225 230 235 240
Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu His Pro Gly Asn Arg Thr
245 250 255
Ser Ser Ser Gln Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Glu
260 265 270
Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Val Ile Leu
275 280 285
Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp
290 295 300
Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln
305 310 315 320
Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu
325 330 335
Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu
340 345 350
Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu
355 360 365
Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile
370 375 380
Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala
385 390 395 400
Glu Thr Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Thr Val Met
405 410 415
Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys
420 425 430
Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His
435 440 445
Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu
450 455 460
Asp Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr
465 470 475 480
Glu Leu Leu Leu Val Pro Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr
485 490 495
<210> 3
<211> 1473
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-65188 DNA 序列
<400> 3
atggcactct tactggcagt attctggtcc gccctgatca ttcttgtaac ccgcacgact 60
agcaaaaaga atctgttgcc ggagccatgg cgtctgccga ttatcggtca catgcaccat 120
ttgatcggca ccatgccgca tcgtggtgtt atggaactgg cccgtaagca tggcagcctg 180
atgcacctgc aactgggtga agtctctacg attgttgtca gcagcccgcg ttgggcgaaa 240
gaggtcttga ccacctatga tatcaccttc gccaatcgcc cggaaaccct gactggcgag 300
atcgtcgcat accacaacac ggatatcgtc ctggcgccgt atggtgagta ttggcgtcaa 360
ctgcgtaaac tgtgcacgct ggagctgctg agcaacaaga aagtgaagag cttccagagc 420
ctgcgcgaag aagagtgttg gaacctggtc aaggacatcc gcagcaccgg ccaaggtagc 480
ccaatcaatc tgtcggagaa cattttcaag atgattgcga cgattctgag ccgtgctgcg 540
ttcggtaagg gtattaagga tcaaatgaag tttaccgaac tggtgaaaga aatcctgcgt 600
ctgaccggcg gttttgatgt cgctgacatc ttccctagca agaagttgct gcaccacctg 660
agcggcaagc gtgcaaaact gaccaatatc cataacaagc tggataatct gatcaataac 720
atcatcgcag agcacccggg caaccgtacc tcgtcctccc aggaaacgct gctggacgtt 780
ctgctgcgcc tgaaagagtc tgcggagttt ccgctgaccg ccgacaacgt taaagcagtg 840
atcctggata tgttcggcgc tggtacggat accagcagcg cgacgatcga gtgggcgatt 900
agcgagctga ttcgctgccc tcgcgcgatg gagaaagtgc agacggaatt gcgtcaggca 960
ctgaatggca aagagcgtat tcaggaagag gatttgcagg agctgaatta tctgaagctg 1020
gtgattaaag aaaccctgcg cctgcatccg ccgttgccgc tggtgatgcc gcgtgagtgc 1080
cgtgaaccgt gtgttttggg cggttacgac attccgagca aaacgaagct gatcgttaat 1140
gttttcgcga ttaaccgtga cccggaatac tggaaagacg cggaaacgtt tatgccggag 1200
cgttttgaga atagcccgat taccgttatg ggttccgagt acgaatacct gccatttggt 1260
gctggtcgtc gtatgtgtcc tggtgcagcg ctgggtctgg ccaacgtgga actgccgctg 1320
gcgcacattc tgtactattt caactggaaa ctgccgaacg gcaagacctt cgaagatttg 1380
gacatgaccg agagctttgg tgccactgtg cagcgcaaaa ccgagctgct gctggttccg 1440
accgactttc aaacgctgac tgcgagcacc taa 1473
<210> 4
<211> 490
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-65188 氨基酸序列
<400> 4
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu Pro Glu Pro Trp Arg Leu
20 25 30
Pro Ile Ile Gly His Met His His Leu Ile Gly Thr Met Pro His Arg
35 40 45
Gly Val Met Glu Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln
50 55 60
Leu Gly Glu Val Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys
65 70 75 80
Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr
85 90 95
Leu Thr Gly Glu Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala
100 105 110
Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu
115 120 125
Leu Leu Ser Asn Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu
130 135 140
Glu Cys Trp Asn Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser
145 150 155 160
Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu
165 170 175
Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr
180 185 190
Glu Leu Val Lys Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala
195 200 205
Asp Ile Phe Pro Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg
210 215 220
Ala Lys Leu Thr Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn
225 230 235 240
Ile Ile Ala Glu His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr
245 250 255
Leu Leu Asp Val Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu
260 265 270
Thr Ala Asp Asn Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly
275 280 285
Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile
290 295 300
Arg Cys Pro Arg Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala
305 310 315 320
Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn
325 330 335
Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu
340 345 350
Pro Leu Val Met Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly
355 360 365
Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile
370 375 380
Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu
385 390 395 400
Arg Phe Glu Asn Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr
405 410 415
Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly
420 425 430
Leu Ala Asn Val Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn
435 440 445
Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu
450 455 460
Ser Phe Gly Ala Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro
465 470 475 480
Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr
485 490
<210> 5
<211> 1509
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-P2 DNA 序列
<400> 5
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500
agcacctaa 1509
<210> 6
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-P2 氨基酸序列
<400> 6
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 7
<211> 1509
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-P2O DNA 序列
<400> 7
atggcactgt tgctggctgt cttttggtct gctctgatta ttttggtggt tacctacacc 60
atctccctgc tgattaacca gtggcgtaaa ccgaaaccac agggtaaatt cccgccgggt 120
ccgtggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500
agcacctaa 1509
<210> 8
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-P2O 氨基酸序列
<400> 8
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 9
<211> 2133
<212> DNA
<213> 椒样薄荷(Mentha piperita)
<400> 9
atggaaccta gctctcagaa actgtctccg ttggaatttg ttgctgctat cctgaagggc 60
gactacagca gcggtcaggt tgaaggtggt ccaccgccag gtctggcagc tatgttgatg 120
gaaaataagg atttggtgat ggttctgacg acgtccgtgg cagtcctgat cggctgtgtc 180
gtggtcctgg catggcgtcg tgcggcaggt agcggtaagt acaagcaacc tgaactgcct 240
aaactggtgg tcccgaaagc agccgaaccg gaggaggcag aggatgataa aaccaagatc 300
agcgtgtttt tcggcaccca aaccggtacg gcagaaggtt tcgcgaaggc ttttgttgaa 360
gaggccaagg cgcgttatca gcaggcccgt ttcaaagtta tcgacctgga cgactatgcg 420
gcagacgatg acgagtacga agagaaactg aagaaggaaa acttggcatt cttcttcttg 480
gcgtcctacg gtgacggcga gccgacggac aacgcggcac gcttttacaa atggtttacg 540
gagggtaagg accgtggtga atggctgaac aatctgcagt acggcgtttt tggtctgggt 600
aaccgtcaat atgagcattt caataagatc gccattgtcg tcgatgatct gatcttcgag 660
caaggtggca agaagctggt tccggtgggt ctgggtgacg atgaccagtg cattgaggat 720
gattttgcgg cgtggcgtga actggtctgg ccggaactgg ataaactgct gcgtaacgaa 780
gacgacgcta ccgtggcaac cccgtacagc gccgctgtgc tgcaataccg cgtggttttc 840
cacgatcaca ttgacggcct gattagcgaa aacggtagcc cgaacggtca tgctaatggc 900
aataccgtgt acgatgcgca acacccgtgc cgtagcaacg tcgcggtcaa gaaggaattg 960
catactccgg cgagcgatcg cagctgcacc cacctggaat ttaacattag cggtaccggc 1020
ctgatgtacg agacgggtga ccacgtcggt gtgtattgcg agaacctgtt ggaaaccgtg 1080
gaggaggccg agaagttgtt gaacctgagc ccgcagacgt acttctccgt tcacaccgac 1140
aacgaggacg gtacgccgtt gagcggcagc agcctgccgc caccgtttcc gccgtgcacc 1200
ttgcgcacgg cattgaccaa atacgcagac ttgacttctg caccgaaaaa gtcggtgctg 1260
gtggcgctgg ccgagtacgc atctgaccag ggtgaagcgg atcgtttgcg tttcttggcg 1320
agcccgagcg gcaaagagga atatgcacag tacatcttgg caagccagcg cacgctgctg 1380
gaggtcatgg cggagttccc gtcggcgaaa ccgccgctgg gtgtcttttt cgcgggtgtc 1440
gctccgcgcc tgcagccgcg tttctattcc attagctcta gcccgaagat cgcaccgttc 1500
cgtattcacg tgacctgcgc cctggtttat gacaaatccc ctaccggtcg cgttcataag 1560
ggcatctgta gcacgtggat gaaaaatgcg gtcccgctgg aagaaagcaa cgattgttcc 1620
tgggctccga tcttcgtccg caacagcaac ttcaagctgc cgaccgaccc gaaggttccg 1680
attatcatga ttggtccggg taccggtctg gccccttttc gtggcttttt gcaagagcgc 1740
ttggcgttga aagagagcgg tgctgaattg ggtccggcga tcttgttctt tggttgccgt 1800
aaccgtaaaa tggactttat ttacgaggat gaactgaatg atttcgtcaa agcgggcgtt 1860
gtcagcgagc tgatcgtcgc ttttagccgc gaaggcccga tgaaagaata cgtgcaacac 1920
aaaatgagcc aacgtgcctc cgatgtgtgg aacatcatta gcgacggtgg ttatgtttat 1980
gtttgcggtg acgcgaaggg tatggctcgt gatgttcacc gtaccctgca taccatcgca 2040
caggagcaag gtagcatgtc cagctcggag gccgaaggta tggtcaaaaa cctgcaaacc 2100
accggtcgtt acctgcgtga tgtgtggtaa taa 2133
<210> 10
<211> 709
<212> PRT
<213> 椒样薄荷(Mentha piperita)
<400> 10
Met Glu Pro Ser Ser Gln Lys Leu Ser Pro Leu Glu Phe Val Ala Ala
1 5 10 15
Ile Leu Lys Gly Asp Tyr Ser Ser Gly Gln Val Glu Gly Gly Pro Pro
20 25 30
Pro Gly Leu Ala Ala Met Leu Met Glu Asn Lys Asp Leu Val Met Val
35 40 45
Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys Val Val Val Leu Ala
50 55 60
Trp Arg Arg Ala Ala Gly Ser Gly Lys Tyr Lys Gln Pro Glu Leu Pro
65 70 75 80
Lys Leu Val Val Pro Lys Ala Ala Glu Pro Glu Glu Ala Glu Asp Asp
85 90 95
Lys Thr Lys Ile Ser Val Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu
100 105 110
Gly Phe Ala Lys Ala Phe Val Glu Glu Ala Lys Ala Arg Tyr Gln Gln
115 120 125
Ala Arg Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp
130 135 140
Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asn Leu Ala Phe Phe Phe Leu
145 150 155 160
Ala Ser Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr
165 170 175
Lys Trp Phe Thr Glu Gly Lys Asp Arg Gly Glu Trp Leu Asn Asn Leu
180 185 190
Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn
195 200 205
Lys Ile Ala Ile Val Val Asp Asp Leu Ile Phe Glu Gln Gly Gly Lys
210 215 220
Lys Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp
225 230 235 240
Asp Phe Ala Ala Trp Arg Glu Leu Val Trp Pro Glu Leu Asp Lys Leu
245 250 255
Leu Arg Asn Glu Asp Asp Ala Thr Val Ala Thr Pro Tyr Ser Ala Ala
260 265 270
Val Leu Gln Tyr Arg Val Val Phe His Asp His Ile Asp Gly Leu Ile
275 280 285
Ser Glu Asn Gly Ser Pro Asn Gly His Ala Asn Gly Asn Thr Val Tyr
290 295 300
Asp Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu
305 310 315 320
His Thr Pro Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asn Ile
325 330 335
Ser Gly Thr Gly Leu Met Tyr Glu Thr Gly Asp His Val Gly Val Tyr
340 345 350
Cys Glu Asn Leu Leu Glu Thr Val Glu Glu Ala Glu Lys Leu Leu Asn
355 360 365
Leu Ser Pro Gln Thr Tyr Phe Ser Val His Thr Asp Asn Glu Asp Gly
370 375 380
Thr Pro Leu Ser Gly Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr
385 390 395 400
Leu Arg Thr Ala Leu Thr Lys Tyr Ala Asp Leu Thr Ser Ala Pro Lys
405 410 415
Lys Ser Val Leu Val Ala Leu Ala Glu Tyr Ala Ser Asp Gln Gly Glu
420 425 430
Ala Asp Arg Leu Arg Phe Leu Ala Ser Pro Ser Gly Lys Glu Glu Tyr
435 440 445
Ala Gln Tyr Ile Leu Ala Ser Gln Arg Thr Leu Leu Glu Val Met Ala
450 455 460
Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val
465 470 475 480
Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Pro Lys
485 490 495
Ile Ala Pro Phe Arg Ile His Val Thr Cys Ala Leu Val Tyr Asp Lys
500 505 510
Ser Pro Thr Gly Arg Val His Lys Gly Ile Cys Ser Thr Trp Met Lys
515 520 525
Asn Ala Val Pro Leu Glu Glu Ser Asn Asp Cys Ser Trp Ala Pro Ile
530 535 540
Phe Val Arg Asn Ser Asn Phe Lys Leu Pro Thr Asp Pro Lys Val Pro
545 550 555 560
Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe
565 570 575
Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Ala Glu Leu Gly Pro
580 585 590
Ala Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Met Asp Phe Ile Tyr
595 600 605
Glu Asp Glu Leu Asn Asp Phe Val Lys Ala Gly Val Val Ser Glu Leu
610 615 620
Ile Val Ala Phe Ser Arg Glu Gly Pro Met Lys Glu Tyr Val Gln His
625 630 635 640
Lys Met Ser Gln Arg Ala Ser Asp Val Trp Asn Ile Ile Ser Asp Gly
645 650 655
Gly Tyr Val Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val
660 665 670
His Arg Thr Leu His Thr Ile Ala Gln Glu Gln Gly Ser Met Ser Ser
675 680 685
Ser Glu Ala Glu Gly Met Val Lys Asn Leu Gln Thr Thr Gly Arg Tyr
690 695 700
Leu Arg Asp Val Trp
705
<210> 11
<211> 1992
<212> DNA
<213> 黄花蒿(Artemisia annua)
<400> 11
atggcactgg acaaactgga cctgtacgta atcatcacct tagtcgtcgc cgtggccgcg 60
tattttgcga aaaatcgccg ctcgtctagc gcagccaaga aagccgcgga gagcccggtt 120
attgtcgtcc cgaagaaggt tacggaggac gaagtggacg acggtcgtaa aaaggtcacg 180
gtgttcttcg gcacgcagac tggtaccgct gaaggtttcg cgaaggcgct ggttgaagaa 240
gcaaaggcgc gctatgaaaa ggcagtgttc aaggttatcg atctggacga ttacgccgca 300
gaggacgacg aatacgagga gaagttgaaa aaggagtccc tcgccttctt cttcctggcg 360
acgtacggcg atggtgagcc gaccgataac gcagctcgtt tctacaagtg gttcaccgag 420
ggtgaggaga agggtgagtg gctggataaa ctgcaatatg cggtctttgg tctgggcaac 480
cgccaatatg agcacttcaa taagatcgca aaggttgtgg atgagaaact ggtcgagcag 540
ggtgccaagc gcctggtgcc ggttggcatg ggtgatgacg atcagtgcat cgaggatgac 600
ttcaccgcct ggaaggagct ggtgtggccg gagctggacc aactgttgcg cgacgaagat 660
gacaccagcg ttgcgacgcc gtataccgcg gcagttggcg aatatcgtgt tgtttttcat 720
gataagccgg aaacctacga tcaggatcaa ctgaccaatg gtcatgctgt gcatgacgcg 780
cagcacccgt gcagaagcaa tgttgctgtt aagaaagaat tgcactctcc gctgtccgat 840
cgcagctgca cccacctgga atttgacatc agcaataccg gtttgagcta cgaaacgggc 900
gatcacgtcg gtgtgtatgt ggaaaatctg agcgaagttg tcgatgaggc tgagaagctg 960
atcggtttac caccgcacac ctacttcagc gtgcatactg acaatgagga tggcacccca 1020
ctgggcggtg ctagcctgcc accgcctttc ccgccttgca ccctgcgcaa agccctcgct 1080
agctacgctg atgtgctgag cagcccgaag aagagcgcac tgctggcact ggcagcacac 1140
gctaccgatt ccaccgaagc cgatcgcctg aagtttttcg ctagcccggc aggcaaggac 1200
gagtatgcgc agtggattgt cgcgagccac cgtagcctgc tggaagtgat ggaggcgttc 1260
ccgagcgcga agcctccgct cggcgtcttt ttcgcatcgg ttgcgcctcg cctgcaaccg 1320
cgttattact caatcagcag ctctccgaaa ttcgcgccga atcgtattca cgttacttgc 1380
gcgctggttt atgagcaaac tccgagcggt cgtgttcaca agggcgtttg ctctacctgg 1440
atgaaaaacg cggttcctat gacggagagc caagactgta gctgggctcc gatttatgtt 1500
cgcacgtcta actttcgcct gcctagcgac ccgaaggtgc cagtgattat gattggtccg 1560
ggtaccggtc tggcaccgtt ccgcggtttc ctgcaagaac gtctggcaca gaaagaagct 1620
ggtacggaat tgggcaccgc aattctgttc tttggttgtc gtaatcgtaa agtggacttt 1680
atctatgagg atgaactgaa caacttcgtg gaaaccggtg ccctgagcga attggtgacg 1740
gctttttctc gtgagggtgc gaccaaagaa tacgtgcagc acaagatgac gcagaaagca 1800
agcgacattt ggaatctgct gtccgaaggt gcgtacctgt atgtctgtgg cgacgcgaag 1860
ggcatggcaa aagacgttca ccgtaccctg cacaccattg tgcaggagca aggtagcctg 1920
gactcttcga aggcggaatt gtacgtcaaa aacctgcaaa tggccggtcg ttatctgcgt 1980
gacgtttggt aa 1992
<210> 12
<211> 663
<212> PRT
<213> 黄花蒿(Artemisia annua)
<400> 12
Met Ala Leu Asp Lys Leu Asp Leu Tyr Val Ile Ile Thr Leu Val Val
1 5 10 15
Ala Val Ala Ala Tyr Phe Ala Lys Asn Arg Arg Ser Ser Ser Ala Ala
20 25 30
Lys Lys Ala Ala Glu Ser Pro Val Ile Val Val Pro Lys Lys Val Thr
35 40 45
Glu Asp Glu Val Asp Asp Gly Arg Lys Lys Val Thr Val Phe Phe Gly
50 55 60
Thr Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu
65 70 75 80
Ala Lys Ala Arg Tyr Glu Lys Ala Val Phe Lys Val Ile Asp Leu Asp
85 90 95
Asp Tyr Ala Ala Glu Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu
100 105 110
Ser Leu Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr
115 120 125
Asp Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Glu Glu Lys
130 135 140
Gly Glu Trp Leu Asp Lys Leu Gln Tyr Ala Val Phe Gly Leu Gly Asn
145 150 155 160
Arg Gln Tyr Glu His Phe Asn Lys Ile Ala Lys Val Val Asp Glu Lys
165 170 175
Leu Val Glu Gln Gly Ala Lys Arg Leu Val Pro Val Gly Met Gly Asp
180 185 190
Asp Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val
195 200 205
Trp Pro Glu Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val
210 215 220
Ala Thr Pro Tyr Thr Ala Ala Val Gly Glu Tyr Arg Val Val Phe His
225 230 235 240
Asp Lys Pro Glu Thr Tyr Asp Gln Asp Gln Leu Thr Asn Gly His Ala
245 250 255
Val His Asp Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys
260 265 270
Glu Leu His Ser Pro Leu Ser Asp Arg Ser Cys Thr His Leu Glu Phe
275 280 285
Asp Ile Ser Asn Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly
290 295 300
Val Tyr Val Glu Asn Leu Ser Glu Val Val Asp Glu Ala Glu Lys Leu
305 310 315 320
Ile Gly Leu Pro Pro His Thr Tyr Phe Ser Val His Thr Asp Asn Glu
325 330 335
Asp Gly Thr Pro Leu Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro
340 345 350
Cys Thr Leu Arg Lys Ala Leu Ala Ser Tyr Ala Asp Val Leu Ser Ser
355 360 365
Pro Lys Lys Ser Ala Leu Leu Ala Leu Ala Ala His Ala Thr Asp Ser
370 375 380
Thr Glu Ala Asp Arg Leu Lys Phe Phe Ala Ser Pro Ala Gly Lys Asp
385 390 395 400
Glu Tyr Ala Gln Trp Ile Val Ala Ser His Arg Ser Leu Leu Glu Val
405 410 415
Met Glu Ala Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala
420 425 430
Ser Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser
435 440 445
Pro Lys Phe Ala Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr
450 455 460
Glu Gln Thr Pro Ser Gly Arg Val His Lys Gly Val Cys Ser Thr Trp
465 470 475 480
Met Lys Asn Ala Val Pro Met Thr Glu Ser Gln Asp Cys Ser Trp Ala
485 490 495
Pro Ile Tyr Val Arg Thr Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys
500 505 510
Val Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg
515 520 525
Gly Phe Leu Gln Glu Arg Leu Ala Gln Lys Glu Ala Gly Thr Glu Leu
530 535 540
Gly Thr Ala Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe
545 550 555 560
Ile Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser
565 570 575
Glu Leu Val Thr Ala Phe Ser Arg Glu Gly Ala Thr Lys Glu Tyr Val
580 585 590
Gln His Lys Met Thr Gln Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser
595 600 605
Glu Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys
610 615 620
Asp Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu
625 630 635 640
Asp Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ala Gly
645 650 655
Arg Tyr Leu Arg Asp Val Trp
660
<210> 13
<211> 3534
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-P2-aaCPR 插入DNA 序列
<400> 13
catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60
accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120
ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga cagaggaaga tataccatgg cactggacaa actggacctg 1560
tacgtaatca tcaccttagt cgtcgccgtg gccgcgtatt ttgcgaaaaa tcgccgctcg 1620
tctagcgcag ccaagaaagc cgcggagagc ccggttattg tcgtcccgaa gaaggttacg 1680
gaggacgaag tggacgacgg tcgtaaaaag gtcacggtgt tcttcggcac gcagactggt 1740
accgctgaag gtttcgcgaa ggcgctggtt gaagaagcaa aggcgcgcta tgaaaaggca 1800
gtgttcaagg ttatcgatct ggacgattac gccgcagagg acgacgaata cgaggagaag 1860
ttgaaaaagg agtccctcgc cttcttcttc ctggcgacgt acggcgatgg tgagccgacc 1920
gataacgcag ctcgtttcta caagtggttc accgagggtg aggagaaggg tgagtggctg 1980
gataaactgc aatatgcggt ctttggtctg ggcaaccgcc aatatgagca cttcaataag 2040
atcgcaaagg ttgtggatga gaaactggtc gagcagggtg ccaagcgcct ggtgccggtt 2100
ggcatgggtg atgacgatca gtgcatcgag gatgacttca ccgcctggaa ggagctggtg 2160
tggccggagc tggaccaact gttgcgcgac gaagatgaca ccagcgttgc gacgccgtat 2220
accgcggcag ttggcgaata tcgtgttgtt tttcatgata agccggaaac ctacgatcag 2280
gatcaactga ccaatggtca tgctgtgcat gacgcgcagc acccgtgcag aagcaatgtt 2340
gctgttaaga aagaattgca ctctccgctg tccgatcgca gctgcaccca cctggaattt 2400
gacatcagca ataccggttt gagctacgaa acgggcgatc acgtcggtgt gtatgtggaa 2460
aatctgagcg aagttgtcga tgaggctgag aagctgatcg gtttaccacc gcacacctac 2520
ttcagcgtgc atactgacaa tgaggatggc accccactgg gcggtgctag cctgccaccg 2580
cctttcccgc cttgcaccct gcgcaaagcc ctcgctagct acgctgatgt gctgagcagc 2640
ccgaagaaga gcgcactgct ggcactggca gcacacgcta ccgattccac cgaagccgat 2700
cgcctgaagt ttttcgctag cccggcaggc aaggacgagt atgcgcagtg gattgtcgcg 2760
agccaccgta gcctgctgga agtgatggag gcgttcccga gcgcgaagcc tccgctcggc 2820
gtctttttcg catcggttgc gcctcgcctg caaccgcgtt attactcaat cagcagctct 2880
ccgaaattcg cgccgaatcg tattcacgtt acttgcgcgc tggtttatga gcaaactccg 2940
agcggtcgtg ttcacaaggg cgtttgctct acctggatga aaaacgcggt tcctatgacg 3000
gagagccaag actgtagctg ggctccgatt tatgttcgca cgtctaactt tcgcctgcct 3060
agcgacccga aggtgccagt gattatgatt ggtccgggta ccggtctggc accgttccgc 3120
ggtttcctgc aagaacgtct ggcacagaaa gaagctggta cggaattggg caccgcaatt 3180
ctgttctttg gttgtcgtaa tcgtaaagtg gactttatct atgaggatga actgaacaac 3240
ttcgtggaaa ccggtgccct gagcgaattg gtgacggctt tttctcgtga gggtgcgacc 3300
aaagaatacg tgcagcacaa gatgacgcag aaagcaagcg acatttggaa tctgctgtcc 3360
gaaggtgcgt acctgtatgt ctgtggcgac gcgaagggca tggcaaaaga cgttcaccgt 3420
accctgcaca ccattgtgca ggagcaaggt agcctggact cttcgaaggc ggaattgtac 3480
gtcaaaaacc tgcaaatggc cggtcgttat ctgcgtgacg tttggtaaaa gctt 3534
<210> 14
<211> 3534
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-P2O-aaCPR 插入DNA 序列
<400> 14
catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60
accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120
ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga cagaggaaga tataccatgg cactggacaa actggacctg 1560
tacgtaatca tcaccttagt cgtcgccgtg gccgcgtatt ttgcgaaaaa tcgccgctcg 1620
tctagcgcag ccaagaaagc cgcggagagc ccggttattg tcgtcccgaa gaaggttacg 1680
gaggacgaag tggacgacgg tcgtaaaaag gtcacggtgt tcttcggcac gcagactggt 1740
accgctgaag gtttcgcgaa ggcgctggtt gaagaagcaa aggcgcgcta tgaaaaggca 1800
gtgttcaagg ttatcgatct ggacgattac gccgcagagg acgacgaata cgaggagaag 1860
ttgaaaaagg agtccctcgc cttcttcttc ctggcgacgt acggcgatgg tgagccgacc 1920
gataacgcag ctcgtttcta caagtggttc accgagggtg aggagaaggg tgagtggctg 1980
gataaactgc aatatgcggt ctttggtctg ggcaaccgcc aatatgagca cttcaataag 2040
atcgcaaagg ttgtggatga gaaactggtc gagcagggtg ccaagcgcct ggtgccggtt 2100
ggcatgggtg atgacgatca gtgcatcgag gatgacttca ccgcctggaa ggagctggtg 2160
tggccggagc tggaccaact gttgcgcgac gaagatgaca ccagcgttgc gacgccgtat 2220
accgcggcag ttggcgaata tcgtgttgtt tttcatgata agccggaaac ctacgatcag 2280
gatcaactga ccaatggtca tgctgtgcat gacgcgcagc acccgtgcag aagcaatgtt 2340
gctgttaaga aagaattgca ctctccgctg tccgatcgca gctgcaccca cctggaattt 2400
gacatcagca ataccggttt gagctacgaa acgggcgatc acgtcggtgt gtatgtggaa 2460
aatctgagcg aagttgtcga tgaggctgag aagctgatcg gtttaccacc gcacacctac 2520
ttcagcgtgc atactgacaa tgaggatggc accccactgg gcggtgctag cctgccaccg 2580
cctttcccgc cttgcaccct gcgcaaagcc ctcgctagct acgctgatgt gctgagcagc 2640
ccgaagaaga gcgcactgct ggcactggca gcacacgcta ccgattccac cgaagccgat 2700
cgcctgaagt ttttcgctag cccggcaggc aaggacgagt atgcgcagtg gattgtcgcg 2760
agccaccgta gcctgctgga agtgatggag gcgttcccga gcgcgaagcc tccgctcggc 2820
gtctttttcg catcggttgc gcctcgcctg caaccgcgtt attactcaat cagcagctct 2880
ccgaaattcg cgccgaatcg tattcacgtt acttgcgcgc tggtttatga gcaaactccg 2940
agcggtcgtg ttcacaaggg cgtttgctct acctggatga aaaacgcggt tcctatgacg 3000
gagagccaag actgtagctg ggctccgatt tatgttcgca cgtctaactt tcgcctgcct 3060
agcgacccga aggtgccagt gattatgatt ggtccgggta ccggtctggc accgttccgc 3120
ggtttcctgc aagaacgtct ggcacagaaa gaagctggta cggaattggg caccgcaatt 3180
ctgttctttg gttgtcgtaa tcgtaaagtg gactttatct atgaggatga actgaacaac 3240
ttcgtggaaa ccggtgccct gagcgaattg gtgacggctt tttctcgtga gggtgcgacc 3300
aaagaatacg tgcagcacaa gatgacgcag aaagcaagcg acatttggaa tctgctgtcc 3360
gaaggtgcgt acctgtatgt ctgtggcgac gcgaagggca tggcaaaaga cgttcaccgt 3420
accctgcaca ccattgtgca ggagcaaggt agcctggact cttcgaaggc ggaattgtac 3480
gtcaaaaacc tgcaaatggc cggtcgttat ctgcgtgacg tttggtaaaa gctt 3534
<210> 15
<211> 3684
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-P2-CPRm 插入DNA 序列
<400> 15
catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60
accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120
ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aaccctgact 1500
gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gctt 3684
<210> 16
<211> 3684
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-P2O-CPRm 插入DNA 序列
<400> 16
catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60
accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120
ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gctt 3684
<210> 17
<211> 3498
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-65188-aaCPR 插入DNA 序列
<400> 17
catatggcac tcttactggc agtattctgg tccgccctga tcattcttgt aacccgcacg 60
actagcaaaa agaatctgtt gccggagcca tggcgtctgc cgattatcgg tcacatgcac 120
catttgatcg gcaccatgcc gcatcgtggt gttatggaac tggcccgtaa gcatggcagc 180
ctgatgcacc tgcaactggg tgaagtctct acgattgttg tcagcagccc gcgttgggcg 240
aaagaggtct tgaccaccta tgatatcacc ttcgccaatc gcccggaaac cctgactggc 300
gagatcgtcg cataccacaa cacggatatc gtcctggcgc cgtatggtga gtattggcgt 360
caactgcgta aactgtgcac gctggagctg ctgagcaaca agaaagtgaa gagcttccag 420
agcctgcgcg aagaagagtg ttggaacctg gtcaaggaca tccgcagcac cggccaaggt 480
agcccaatca atctgtcgga gaacattttc aagatgattg cgacgattct gagccgtgct 540
gcgttcggta agggtattaa ggatcaaatg aagtttaccg aactggtgaa agaaatcctg 600
cgtctgaccg gcggttttga tgtcgctgac atcttcccta gcaagaagtt gctgcaccac 660
ctgagcggca agcgtgcaaa actgaccaat atccataaca agctggataa tctgatcaat 720
aacatcatcg cagagcaccc gggcaaccgt acctcgtcct cccaggaaac gctgctggac 780
gttctgctgc gcctgaaaga gtctgcggag tttccgctga ccgccgacaa cgttaaagca 840
gtgatcctgg atatgttcgg cgctggtacg gataccagca gcgcgacgat cgagtgggcg 900
attagcgagc tgattcgctg ccctcgcgcg atggagaaag tgcagacgga attgcgtcag 960
gcactgaatg gcaaagagcg tattcaggaa gaggatttgc aggagctgaa ttatctgaag 1020
ctggtgatta aagaaaccct gcgcctgcat ccgccgttgc cgctggtgat gccgcgtgag 1080
tgccgtgaac cgtgtgtttt gggcggttac gacattccga gcaaaacgaa gctgatcgtt 1140
aatgttttcg cgattaaccg tgacccggaa tactggaaag acgcggaaac gtttatgccg 1200
gagcgttttg agaatagccc gattaccgtt atgggttccg agtacgaata cctgccattt 1260
ggtgctggtc gtcgtatgtg tcctggtgca gcgctgggtc tggccaacgt ggaactgccg 1320
ctggcgcaca ttctgtacta tttcaactgg aaactgccga acggcaagac cttcgaagat 1380
ttggacatga ccgagagctt tggtgccact gtgcagcgca aaaccgagct gctgctggtt 1440
ccgaccgact ttcaaacgct gactgcgagc acctaatgag tcgacagagg aagatatacc 1500
atggcactgg acaaactgga cctgtacgta atcatcacct tagtcgtcgc cgtggccgcg 1560
tattttgcga aaaatcgccg ctcgtctagc gcagccaaga aagccgcgga gagcccggtt 1620
attgtcgtcc cgaagaaggt tacggaggac gaagtggacg acggtcgtaa aaaggtcacg 1680
gtgttcttcg gcacgcagac tggtaccgct gaaggtttcg cgaaggcgct ggttgaagaa 1740
gcaaaggcgc gctatgaaaa ggcagtgttc aaggttatcg atctggacga ttacgccgca 1800
gaggacgacg aatacgagga gaagttgaaa aaggagtccc tcgccttctt cttcctggcg 1860
acgtacggcg atggtgagcc gaccgataac gcagctcgtt tctacaagtg gttcaccgag 1920
ggtgaggaga agggtgagtg gctggataaa ctgcaatatg cggtctttgg tctgggcaac 1980
cgccaatatg agcacttcaa taagatcgca aaggttgtgg atgagaaact ggtcgagcag 2040
ggtgccaagc gcctggtgcc ggttggcatg ggtgatgacg atcagtgcat cgaggatgac 2100
ttcaccgcct ggaaggagct ggtgtggccg gagctggacc aactgttgcg cgacgaagat 2160
gacaccagcg ttgcgacgcc gtataccgcg gcagttggcg aatatcgtgt tgtttttcat 2220
gataagccgg aaacctacga tcaggatcaa ctgaccaatg gtcatgctgt gcatgacgcg 2280
cagcacccgt gcagaagcaa tgttgctgtt aagaaagaat tgcactctcc gctgtccgat 2340
cgcagctgca cccacctgga atttgacatc agcaataccg gtttgagcta cgaaacgggc 2400
gatcacgtcg gtgtgtatgt ggaaaatctg agcgaagttg tcgatgaggc tgagaagctg 2460
atcggtttac caccgcacac ctacttcagc gtgcatactg acaatgagga tggcacccca 2520
ctgggcggtg ctagcctgcc accgcctttc ccgccttgca ccctgcgcaa agccctcgct 2580
agctacgctg atgtgctgag cagcccgaag aagagcgcac tgctggcact ggcagcacac 2640
gctaccgatt ccaccgaagc cgatcgcctg aagtttttcg ctagcccggc aggcaaggac 2700
gagtatgcgc agtggattgt cgcgagccac cgtagcctgc tggaagtgat ggaggcgttc 2760
ccgagcgcga agcctccgct cggcgtcttt ttcgcatcgg ttgcgcctcg cctgcaaccg 2820
cgttattact caatcagcag ctctccgaaa ttcgcgccga atcgtattca cgttacttgc 2880
gcgctggttt atgagcaaac tccgagcggt cgtgttcaca agggcgtttg ctctacctgg 2940
atgaaaaacg cggttcctat gacggagagc caagactgta gctgggctcc gatttatgtt 3000
cgcacgtcta actttcgcct gcctagcgac ccgaaggtgc cagtgattat gattggtccg 3060
ggtaccggtc tggcaccgtt ccgcggtttc ctgcaagaac gtctggcaca gaaagaagct 3120
ggtacggaat tgggcaccgc aattctgttc tttggttgtc gtaatcgtaa agtggacttt 3180
atctatgagg atgaactgaa caacttcgtg gaaaccggtg ccctgagcga attggtgacg 3240
gctttttctc gtgagggtgc gaccaaagaa tacgtgcagc acaagatgac gcagaaagca 3300
agcgacattt ggaatctgct gtccgaaggt gcgtacctgt atgtctgtgg cgacgcgaag 3360
ggcatggcaa aagacgttca ccgtaccctg cacaccattg tgcaggagca aggtagcctg 3420
gactcttcga aggcggaatt gtacgtcaaa aacctgcaaa tggccggtcg ttatctgcgt 3480
gacgtttggt aaaagctt 3498
<210> 18
<211> 3648
<212> DNA
<213> 人工序列
<220>
<223> pCWori-CYP71AV8-65188-CPRm 插入DNA 序列
<400> 18
catatggcac tcttactggc agtattctgg tccgccctga tcattcttgt aacccgcacg 60
actagcaaaa agaatctgtt gccggagcca tggcgtctgc cgattatcgg tcacatgcac 120
catttgatcg gcaccatgcc gcatcgtggt gttatggaac tggcccgtaa gcatggcagc 180
ctgatgcacc tgcaactggg tgaagtctct acgattgttg tcagcagccc gcgttgggcg 240
aaagaggtct tgaccaccta tgatatcacc ttcgccaatc gcccggaaac cctgactggc 300
gagatcgtcg cataccacaa cacggatatc gtcctggcgc cgtatggtga gtattggcgt 360
caactgcgta aactgtgcac gctggagctg ctgagcaaca agaaagtgaa gagcttccag 420
agcctgcgcg aagaagagtg ttggaacctg gtcaaggaca tccgcagcac cggccaaggt 480
agcccaatca atctgtcgga gaacattttc aagatgattg cgacgattct gagccgtgct 540
gcgttcggta agggtattaa ggatcaaatg aagtttaccg aactggtgaa agaaatcctg 600
cgtctgaccg gcggttttga tgtcgctgac atcttcccta gcaagaagtt gctgcaccac 660
ctgagcggca agcgtgcaaa actgaccaat atccataaca agctggataa tctgatcaat 720
aacatcatcg cagagcaccc gggcaaccgt acctcgtcct cccaggaaac gctgctggac 780
gttctgctgc gcctgaaaga gtctgcggag tttccgctga ccgccgacaa cgttaaagca 840
gtgatcctgg atatgttcgg cgctggtacg gataccagca gcgcgacgat cgagtgggcg 900
attagcgagc tgattcgctg ccctcgcgcg atggagaaag tgcagacgga attgcgtcag 960
gcactgaatg gcaaagagcg tattcaggaa gaggatttgc aggagctgaa ttatctgaag 1020
ctggtgatta aagaaaccct gcgcctgcat ccgccgttgc cgctggtgat gccgcgtgag 1080
tgccgtgaac cgtgtgtttt gggcggttac gacattccga gcaaaacgaa gctgatcgtt 1140
aatgttttcg cgattaaccg tgacccggaa tactggaaag acgcggaaac gtttatgccg 1200
gagcgttttg agaatagccc gattaccgtt atgggttccg agtacgaata cctgccattt 1260
ggtgctggtc gtcgtatgtg tcctggtgca gcgctgggtc tggccaacgt ggaactgccg 1320
ctggcgcaca ttctgtacta tttcaactgg aaactgccga acggcaagac cttcgaagat 1380
ttggacatga ccgagagctt tggtgccact gtgcagcgca aaaccgagct gctgctggtt 1440
ccgaccgact ttcaaacgct gactgcgagc acctaatgag tcgactaact ttaagaagga 1500
gatatatcca tggaacctag ctctcagaaa ctgtctccgt tggaatttgt tgctgctatc 1560
ctgaagggcg actacagcag cggtcaggtt gaaggtggtc caccgccagg tctggcagct 1620
atgttgatgg aaaataagga tttggtgatg gttctgacga cgtccgtggc agtcctgatc 1680
ggctgtgtcg tggtcctggc atggcgtcgt gcggcaggta gcggtaagta caagcaacct 1740
gaactgccta aactggtggt cccgaaagca gccgaaccgg aggaggcaga ggatgataaa 1800
accaagatca gcgtgttttt cggcacccaa accggtacgg cagaaggttt cgcgaaggct 1860
tttgttgaag aggccaaggc gcgttatcag caggcccgtt tcaaagttat cgacctggac 1920
gactatgcgg cagacgatga cgagtacgaa gagaaactga agaaggaaaa cttggcattc 1980
ttcttcttgg cgtcctacgg tgacggcgag ccgacggaca acgcggcacg cttttacaaa 2040
tggtttacgg agggtaagga ccgtggtgaa tggctgaaca atctgcagta cggcgttttt 2100
ggtctgggta accgtcaata tgagcatttc aataagatcg ccattgtcgt cgatgatctg 2160
atcttcgagc aaggtggcaa gaagctggtt ccggtgggtc tgggtgacga tgaccagtgc 2220
attgaggatg attttgcggc gtggcgtgaa ctggtctggc cggaactgga taaactgctg 2280
cgtaacgaag acgacgctac cgtggcaacc ccgtacagcg ccgctgtgct gcaataccgc 2340
gtggttttcc acgatcacat tgacggcctg attagcgaaa acggtagccc gaacggtcat 2400
gctaatggca ataccgtgta cgatgcgcaa cacccgtgcc gtagcaacgt cgcggtcaag 2460
aaggaattgc atactccggc gagcgatcgc agctgcaccc acctggaatt taacattagc 2520
ggtaccggcc tgatgtacga gacgggtgac cacgtcggtg tgtattgcga gaacctgttg 2580
gaaaccgtgg aggaggccga gaagttgttg aacctgagcc cgcagacgta cttctccgtt 2640
cacaccgaca acgaggacgg tacgccgttg agcggcagca gcctgccgcc accgtttccg 2700
ccgtgcacct tgcgcacggc attgaccaaa tacgcagact tgacttctgc accgaaaaag 2760
tcggtgctgg tggcgctggc cgagtacgca tctgaccagg gtgaagcgga tcgtttgcgt 2820
ttcttggcga gcccgagcgg caaagaggaa tatgcacagt acatcttggc aagccagcgc 2880
acgctgctgg aggtcatggc ggagttcccg tcggcgaaac cgccgctggg tgtctttttc 2940
gcgggtgtcg ctccgcgcct gcagccgcgt ttctattcca ttagctctag cccgaagatc 3000
gcaccgttcc gtattcacgt gacctgcgcc ctggtttatg acaaatcccc taccggtcgc 3060
gttcataagg gcatctgtag cacgtggatg aaaaatgcgg tcccgctgga agaaagcaac 3120
gattgttcct gggctccgat cttcgtccgc aacagcaact tcaagctgcc gaccgacccg 3180
aaggttccga ttatcatgat tggtccgggt accggtctgg ccccttttcg tggctttttg 3240
caagagcgct tggcgttgaa agagagcggt gctgaattgg gtccggcgat cttgttcttt 3300
ggttgccgta accgtaaaat ggactttatt tacgaggatg aactgaatga tttcgtcaaa 3360
gcgggcgttg tcagcgagct gatcgtcgct tttagccgcg aaggcccgat gaaagaatac 3420
gtgcaacaca aaatgagcca acgtgcctcc gatgtgtgga acatcattag cgacggtggt 3480
tatgtttatg tttgcggtga cgcgaagggt atggctcgtg atgttcaccg taccctgcat 3540
accatcgcac aggagcaagg tagcatgtcc agctcggagg ccgaaggtat ggtcaaaaac 3600
ctgcaaacca ccggtcgtta cctgcgtgat gtgtggtaat aaaagctt 3648
<210> 19
<211> 1665
<212> DNA
<213> 人工序列
<220>
<223> α-檀香萜合酶优化的 cDNA 序列
<400> 19
atggaccaca tgtctaccca gcaggttagc tccgagaata tcgttcgcaa cgcggcgaac 60
ttccacccga atatctgggg taatcatttc ttgacgtgtc caagccagac gatcgattct 120
tggacgcaac aacaccataa agagctgaaa gaagaggtcc gcaagatgat ggtgagcgac 180
gcaaacaaac cggcacaacg tctgcgtctg attgacaccg ttcaacgttt gggcgtggcg 240
tatcatttcg aaaaagaaat cgatgacgct ctggaaaaga tcggtcacga tccgtttgac 300
gataaggatg acctgtatat cgttagcctg tgttttcgcc tgctgcgtca gcatggcatc 360
aagattagct gcgatgtttt tgagaagttc aaagacgacg atggcaagtt taaggcttcc 420
ctgatgaatg atgtccaagg tatgctgtcg ttgtatgaag cggcccacct ggcaattcat 480
ggcgaggaca tcctggatga ggctattgtc tttacgacca cccacctgaa gagcaccgtt 540
tctaactccc cggtcaattc cacctttgcg gaacagattc gccacagcct gcgtgtgccg 600
ctgcgtaagg cagtcccgcg tttggagagc cgctacttcc tggatatcta tagccgtgac 660
gacctgcacg acaagactct gctgaacttt gccaaactgg acttcaacat cctgcaggcg 720
atgcaccaga aagaggcaag cgagatgacc cgttggtggc gtgatttcga tttcctgaag 780
aagctgccgt acattcgtga tcgcgtggtt gaactgtact tttggatttt ggtcggtgtg 840
agctaccaac cgaaattcag cacgggtcgt atctttttga gcaagattat ctgtctggaa 900
accctggtgg acgacacgtt tgatgcgtac ggtactttcg acgaactggc cattttcacc 960
gaggccgtta cgcgttggga cctgggtcat cgcgacgcgc tgcctgagta catgaaattc 1020
attttcaaga ccctgattga tgtgtacagc gaggcggaac aagagctggc aaaagagggc 1080
cgctcctata gcattcacta tgcgatccgt agcttccagg agttggtcat gaagtacttt 1140
tgcgaggcga aatggctgaa taagggttat gttccgagcc tggatgacta caagagcgtc 1200
agcctgcgca gcatcggctt cctgccgatc gccgtggctt cttttgtttt catgggcgac 1260
attgctacga aagaggtttt tgagtgggaa atgaataacc cgaaaatcat catcgcagcc 1320
gaaaccattt tccgctttct ggatgacatt gcaggtcatc gcttcgaaca aaaacgtgag 1380
cacagcccga gcgcaatcga gtgctacaaa aaccaacatg gtgtctcgga agaagaggca 1440
gtgaaagcgc tgagcttgga ggtcgccaat tcgtggaaag acattaacga agagctgctg 1500
ctgaacccta tggcaattcc actgccgttg ctgcaggtga tcctggattt gagccgtagc 1560
gcggacttca tgtacggtaa tgcgcaggac cgtttcacgc actccaccat gatgaaagat 1620
caagttgacc tggttctgaa agatccggtg aaactggacg attaa 1665
<210> 20
<211> 554
<212> PRT
<213> 人工序列
<220>
<223> α-檀香萜合酶氨基酸序列
<400> 20
Met Asp His Met Ser Thr Gln Gln Val Ser Ser Glu Asn Ile Val Arg
1 5 10 15
Asn Ala Ala Asn Phe His Pro Asn Ile Trp Gly Asn His Phe Leu Thr
20 25 30
Cys Pro Ser Gln Thr Ile Asp Ser Trp Thr Gln Gln His His Lys Glu
35 40 45
Leu Lys Glu Glu Val Arg Lys Met Met Val Ser Asp Ala Asn Lys Pro
50 55 60
Ala Gln Arg Leu Arg Leu Ile Asp Thr Val Gln Arg Leu Gly Val Ala
65 70 75 80
Tyr His Phe Glu Lys Glu Ile Asp Asp Ala Leu Glu Lys Ile Gly His
85 90 95
Asp Pro Phe Asp Asp Lys Asp Asp Leu Tyr Ile Val Ser Leu Cys Phe
100 105 110
Arg Leu Leu Arg Gln His Gly Ile Lys Ile Ser Cys Asp Val Phe Glu
115 120 125
Lys Phe Lys Asp Asp Asp Gly Lys Phe Lys Ala Ser Leu Met Asn Asp
130 135 140
Val Gln Gly Met Leu Ser Leu Tyr Glu Ala Ala His Leu Ala Ile His
145 150 155 160
Gly Glu Asp Ile Leu Asp Glu Ala Ile Val Phe Thr Thr Thr His Leu
165 170 175
Lys Ser Thr Val Ser Asn Ser Pro Val Asn Ser Thr Phe Ala Glu Gln
180 185 190
Ile Arg His Ser Leu Arg Val Pro Leu Arg Lys Ala Val Pro Arg Leu
195 200 205
Glu Ser Arg Tyr Phe Leu Asp Ile Tyr Ser Arg Asp Asp Leu His Asp
210 215 220
Lys Thr Leu Leu Asn Phe Ala Lys Leu Asp Phe Asn Ile Leu Gln Ala
225 230 235 240
Met His Gln Lys Glu Ala Ser Glu Met Thr Arg Trp Trp Arg Asp Phe
245 250 255
Asp Phe Leu Lys Lys Leu Pro Tyr Ile Arg Asp Arg Val Val Glu Leu
260 265 270
Tyr Phe Trp Ile Leu Val Gly Val Ser Tyr Gln Pro Lys Phe Ser Thr
275 280 285
Gly Arg Ile Phe Leu Ser Lys Ile Ile Cys Leu Glu Thr Leu Val Asp
290 295 300
Asp Thr Phe Asp Ala Tyr Gly Thr Phe Asp Glu Leu Ala Ile Phe Thr
305 310 315 320
Glu Ala Val Thr Arg Trp Asp Leu Gly His Arg Asp Ala Leu Pro Glu
325 330 335
Tyr Met Lys Phe Ile Phe Lys Thr Leu Ile Asp Val Tyr Ser Glu Ala
340 345 350
Glu Gln Glu Leu Ala Lys Glu Gly Arg Ser Tyr Ser Ile His Tyr Ala
355 360 365
Ile Arg Ser Phe Gln Glu Leu Val Met Lys Tyr Phe Cys Glu Ala Lys
370 375 380
Trp Leu Asn Lys Gly Tyr Val Pro Ser Leu Asp Asp Tyr Lys Ser Val
385 390 395 400
Ser Leu Arg Ser Ile Gly Phe Leu Pro Ile Ala Val Ala Ser Phe Val
405 410 415
Phe Met Gly Asp Ile Ala Thr Lys Glu Val Phe Glu Trp Glu Met Asn
420 425 430
Asn Pro Lys Ile Ile Ile Ala Ala Glu Thr Ile Phe Arg Phe Leu Asp
435 440 445
Asp Ile Ala Gly His Arg Phe Glu Gln Lys Arg Glu His Ser Pro Ser
450 455 460
Ala Ile Glu Cys Tyr Lys Asn Gln His Gly Val Ser Glu Glu Glu Ala
465 470 475 480
Val Lys Ala Leu Ser Leu Glu Val Ala Asn Ser Trp Lys Asp Ile Asn
485 490 495
Glu Glu Leu Leu Leu Asn Pro Met Ala Ile Pro Leu Pro Leu Leu Gln
500 505 510
Val Ile Leu Asp Leu Ser Arg Ser Ala Asp Phe Met Tyr Gly Asn Ala
515 520 525
Gln Asp Arg Phe Thr His Ser Thr Met Met Lys Asp Gln Val Asp Leu
530 535 540
Val Leu Lys Asp Pro Val Lys Leu Asp Asp
545 550
<210> 21
<211> 1728
<212> DNA
<213> 人工序列
<220>
<223> SaTps8201-1-FL_optEc (α-檀香萜合酶优化的全长cDNA)包括RBS区域和限制位点
<400> 21
aggaggtaaa acatatggac agcagcaccg ccaccgcaat gaccgcacca ttcatcgacc 60
cgacggatca tgtgaatctg aaaaccgaca cggatgcgag cgaaaatcgt cgtatgggta 120
actacaagcc gagcatttgg aactacgatt ttctgcagtc cctggcgacg caccacaaca 180
ttgttgaaga gcgtcacctg aagctggcag agaaactgaa aggtcaagtg aaattcatgt 240
tcggtgcgcc gatggagcca ttggctaagt tggagctggt tgatgtggtg caacgcttgg 300
gtctgaacca cctgttcgag actgaaatca aagaagctct gttcagcatc tacaaagatg 360
gcagcaatgg ctggtggttt ggccatctgc atgctacctc tttgcgcttc cgtctgttgc 420
gccaatgtgg cctgtttatc ccgcaggacg ttttcaaaac ctttcaaaac aagaccggtg 480
agtttgacat gaagctgtgg gacaacgtta agggcctgct gagcctgtac gaggcgagct 540
acctgggctg gaagggcgag aacatcttgg atgaagcaaa ggcgttcacg accaagtgcc 600
tgaagagcgc atgggagaac attagcgaga agtggctggc gaagcgtgtt aaacatgcgt 660
tggcgctgcc gctgcactgg cgtgttccgc gtattgaagc acgctggttt atcgaggtgt 720
acgaacaaga ggccaatatg aatccgacgc tgctgaaact ggcgaaactg gacttcaaca 780
tggtccaaag cattcaccag aaagaaatcg gtgaactggc ccgctggtgg gttactaccg 840
gcctggacaa gctggatttc gcacgcaaca atctgttgca gtcttatatg tggagctgcg 900
ccatcgcgtc cgacccgaaa ttcaaactgg cgcgtgaaac cattgtcgag atcggttccg 960
tgttgacggt tgtcgacgac ggctatgatg tgtacggttc tatggatgag ctggacctgt 1020
acaccagctc ggtggagcgt tggtcctgtg tcaaaattga caagctgcct aatacgctga 1080
agctgatctt tatgtctatg ttcaacaaaa ccaacgaggt gggtctgcgt gttcaacacg 1140
agcgtggtta caatagcatc ccgaccttca ttaaggcgtg ggtggaacag tgtaagagct 1200
atcaaaaaga ggcgcgttgg tttcatggtg gtcacacgcc tccgctggaa gaatacagcc 1260
tgaacggtct ggtcagcatt ggttttccgc tgttgctgat caccggctat gttgcgattg 1320
ctgagaatga agcagccctg gataaagtcc acccgctgcc ggacctgctg cattattcca 1380
gcttgctgag ccgtctgatt aatgatatcg gcactagccc ggatgaaatg gcgcgtggtg 1440
acaatctgaa gagcattcac tgctatatga atgaaaccgg tgccagcgaa gaggtcgcac 1500
gcgagcacat caaaggcgtc atcgaagaga attggaaaat tctgaaccag tgttgctttg 1560
accagtccca gttccaggag ccgttcatca cgtttaacct gaacagcgtg cgcggctcgc 1620
atttcttcta tgaatttggt gatggttttg gtgttaccga cagctggacc aaggtggata 1680
tgaaaagcgt cctgattgat ccgattccgc tgggtgaaga gtaagctt 1728
<210> 22
<211> 569
<212> PRT
<213> 人工序列
<220>
<223> SaTps8201-1-FL (α/β-檀香萜合酶全长)氨基酸序列
<400> 22
Met Asp Ser Ser Thr Ala Thr Ala Met Thr Ala Pro Phe Ile Asp Pro
1 5 10 15
Thr Asp His Val Asn Leu Lys Thr Asp Thr Asp Ala Ser Glu Asn Arg
20 25 30
Arg Met Gly Asn Tyr Lys Pro Ser Ile Trp Asn Tyr Asp Phe Leu Gln
35 40 45
Ser Leu Ala Thr His His Asn Ile Val Glu Glu Arg His Leu Lys Leu
50 55 60
Ala Glu Lys Leu Lys Gly Gln Val Lys Phe Met Phe Gly Ala Pro Met
65 70 75 80
Glu Pro Leu Ala Lys Leu Glu Leu Val Asp Val Val Gln Arg Leu Gly
85 90 95
Leu Asn His Leu Phe Glu Thr Glu Ile Lys Glu Ala Leu Phe Ser Ile
100 105 110
Tyr Lys Asp Gly Ser Asn Gly Trp Trp Phe Gly His Leu His Ala Thr
115 120 125
Ser Leu Arg Phe Arg Leu Leu Arg Gln Cys Gly Leu Phe Ile Pro Gln
130 135 140
Asp Val Phe Lys Thr Phe Gln Asn Lys Thr Gly Glu Phe Asp Met Lys
145 150 155 160
Leu Trp Asp Asn Val Lys Gly Leu Leu Ser Leu Tyr Glu Ala Ser Tyr
165 170 175
Leu Gly Trp Lys Gly Glu Asn Ile Leu Asp Glu Ala Lys Ala Phe Thr
180 185 190
Thr Lys Cys Leu Lys Ser Ala Trp Glu Asn Ile Ser Glu Lys Trp Leu
195 200 205
Ala Lys Arg Val Lys His Ala Leu Ala Leu Pro Leu His Trp Arg Val
210 215 220
Pro Arg Ile Glu Ala Arg Trp Phe Ile Glu Val Tyr Glu Gln Glu Ala
225 230 235 240
Asn Met Asn Pro Thr Leu Leu Lys Leu Ala Lys Leu Asp Phe Asn Met
245 250 255
Val Gln Ser Ile His Gln Lys Glu Ile Gly Glu Leu Ala Arg Trp Trp
260 265 270
Val Thr Thr Gly Leu Asp Lys Leu Asp Phe Ala Arg Asn Asn Leu Leu
275 280 285
Gln Ser Tyr Met Trp Ser Cys Ala Ile Ala Ser Asp Pro Lys Phe Lys
290 295 300
Leu Ala Arg Glu Thr Ile Val Glu Ile Gly Ser Val Leu Thr Val Val
305 310 315 320
Asp Asp Gly Tyr Asp Val Tyr Gly Ser Met Asp Glu Leu Asp Leu Tyr
325 330 335
Thr Ser Ser Val Glu Arg Trp Ser Cys Val Lys Ile Asp Lys Leu Pro
340 345 350
Asn Thr Leu Lys Leu Ile Phe Met Ser Met Phe Asn Lys Thr Asn Glu
355 360 365
Val Gly Leu Arg Val Gln His Glu Arg Gly Tyr Asn Ser Ile Pro Thr
370 375 380
Phe Ile Lys Ala Trp Val Glu Gln Cys Lys Ser Tyr Gln Lys Glu Ala
385 390 395 400
Arg Trp Phe His Gly Gly His Thr Pro Pro Leu Glu Glu Tyr Ser Leu
405 410 415
Asn Gly Leu Val Ser Ile Gly Phe Pro Leu Leu Leu Ile Thr Gly Tyr
420 425 430
Val Ala Ile Ala Glu Asn Glu Ala Ala Leu Asp Lys Val His Pro Leu
435 440 445
Pro Asp Leu Leu His Tyr Ser Ser Leu Leu Ser Arg Leu Ile Asn Asp
450 455 460
Ile Gly Thr Ser Pro Asp Glu Met Ala Arg Gly Asp Asn Leu Lys Ser
465 470 475 480
Ile His Cys Tyr Met Asn Glu Thr Gly Ala Ser Glu Glu Val Ala Arg
485 490 495
Glu His Ile Lys Gly Val Ile Glu Glu Asn Trp Lys Ile Leu Asn Gln
500 505 510
Cys Cys Phe Asp Gln Ser Gln Phe Gln Glu Pro Phe Ile Thr Phe Asn
515 520 525
Leu Asn Ser Val Arg Gly Ser His Phe Phe Tyr Glu Phe Gly Asp Gly
530 535 540
Phe Gly Val Thr Asp Ser Trp Thr Lys Val Asp Met Lys Ser Val Leu
545 550 555 560
Ile Asp Pro Ile Pro Leu Gly Glu Glu
565
<210> 23
<211> 5361
<212> DNA
<213> 人工序列
<220>
<223> 包含CYP71AV-P2、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的ClASS的合成操纵子的DNA 序列
<400> 23
catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60
accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120
ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gcttgaagga gatatactaa tgtctaccca gcaggttagc 3720
tccgagaata tcgttcgcaa cgcggcgaac ttccacccga atatctgggg taatcatttc 3780
ttgacgtgtc caagccagac gatcgattct tggacgcaac aacaccataa agagctgaaa 3840
gaagaggtcc gcaagatgat ggtgagcgac gcaaacaaac cggcacaacg tctgcgtctg 3900
attgacaccg ttcaacgttt gggcgtggcg tatcatttcg aaaaagaaat cgatgacgct 3960
ctggaaaaga tcggtcacga tccgtttgac gataaggatg acctgtatat cgttagcctg 4020
tgttttcgcc tgctgcgtca gcatggcatc aagattagct gcgatgtttt tgagaagttc 4080
aaagacgacg atggcaagtt taaggcttcc ctgatgaatg atgtccaagg tatgctgtcg 4140
ttgtatgaag cggcccacct ggcaattcat ggcgaggaca tcctggatga ggctattgtc 4200
tttacgacca cccacctgaa gagcaccgtt tctaactccc cggtcaattc cacctttgcg 4260
gaacagattc gccacagcct gcgtgtgccg ctgcgtaagg cagtcccgcg tttggagagc 4320
cgctacttcc tggatatcta tagccgtgac gacctgcacg acaagactct gctgaacttt 4380
gccaaactgg acttcaacat cctgcaggcg atgcaccaga aagaggcaag cgagatgacc 4440
cgttggtggc gtgatttcga tttcctgaag aagctgccgt acattcgtga tcgcgtggtt 4500
gaactgtact tttggatttt ggtcggtgtg agctaccaac cgaaattcag cacgggtcgt 4560
atctttttga gcaagattat ctgtctggaa accctggtgg acgacacgtt tgatgcgtac 4620
ggtactttcg acgaactggc cattttcacc gaggccgtta cgcgttggga cctgggtcat 4680
cgcgacgcgc tgcctgagta catgaaattc attttcaaga ccctgattga tgtgtacagc 4740
gaggcggaac aagagctggc aaaagagggc cgctcctata gcattcacta tgcgatccgt 4800
agcttccagg agttggtcat gaagtacttt tgcgaggcga aatggctgaa taagggttat 4860
gttccgagcc tggatgacta caagagcgtc agcctgcgca gcatcggctt cctgccgatc 4920
gccgtggctt cttttgtttt catgggcgac attgctacga aagaggtttt tgagtgggaa 4980
atgaataacc cgaaaatcat catcgcagcc gaaaccattt tccgctttct ggatgacatt 5040
gcaggtcatc gcttcgaaca aaaacgtgag cacagcccga gcgcaatcga gtgctacaaa 5100
aaccaacatg gtgtctcgga agaagaggca gtgaaagcgc tgagcttgga ggtcgccaat 5160
tcgtggaaag acattaacga agagctgctg ctgaacccta tggcaattcc actgccgttg 5220
ctgcaggtga tcctggattt gagccgtagc gcggacttca tgtacggtaa tgcgcaggac 5280
cgtttcacgc actccaccat gatgaaagat caagttgacc tggttctgaa agatccggtg 5340
aaactggacg attaagaatt c 5361
<210> 24
<211> 5414
<212> DNA
<213> 人工序列
<220>
<223> 包含CYP71AV-P2、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的SaSAS的合成操纵子的DNA 序列
<400> 24
catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60
accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120
ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aaccctgact 1500
gcaagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gcttaggagg taaaacatat ggacagcagc accgccaccg 3720
caatgaccgc accattcatc gacccgacgg atcatgtgaa tctgaaaacc gacacggatg 3780
cgagcgaaaa tcgtcgtatg ggtaactaca agccgagcat ttggaactac gattttctgc 3840
agtccctggc gacgcaccac aacattgttg aagagcgtca cctgaagctg gcagagaaac 3900
tgaaaggtca agtgaaattc atgttcggtg cgccgatgga gccattggct aagttggagc 3960
tggttgatgt ggtgcaacgc ttgggtctga accacctgtt cgagactgaa atcaaagaag 4020
ctctgttcag catctacaaa gatggcagca atggctggtg gtttggccat ctgcatgcta 4080
cctctttgcg cttccgtctg ttgcgccaat gtggcctgtt tatcccgcag gacgttttca 4140
aaacctttca aaacaagacc ggtgagtttg acatgaagct gtgggacaac gttaagggcc 4200
tgctgagcct gtacgaggcg agctacctgg gctggaaggg cgagaacatc ttggatgaag 4260
caaaggcgtt cacgaccaag tgcctgaaga gcgcatggga gaacattagc gagaagtggc 4320
tggcgaagcg tgttaaacat gcgttggcgc tgccgctgca ctggcgtgtt ccgcgtattg 4380
aagcacgctg gtttatcgag gtgtacgaac aagaggccaa tatgaatccg acgctgctga 4440
aactggcgaa actggacttc aacatggtcc aaagcattca ccagaaagaa atcggtgaac 4500
tggcccgctg gtgggttact accggcctgg acaagctgga tttcgcacgc aacaatctgt 4560
tgcagtctta tatgtggagc tgcgccatcg cgtccgaccc gaaattcaaa ctggcgcgtg 4620
aaaccattgt cgagatcggt tccgtgttga cggttgtcga cgacggctat gatgtgtacg 4680
gttctatgga tgagctggac ctgtacacca gctcggtgga gcgttggtcc tgtgtcaaaa 4740
ttgacaagct gcctaatacg ctgaagctga tctttatgtc tatgttcaac aaaaccaacg 4800
aggtgggtct gcgtgttcaa cacgagcgtg gttacaatag catcccgacc ttcattaagg 4860
cgtgggtgga acagtgtaag agctatcaaa aagaggcgcg ttggtttcat ggtggtcaca 4920
cgcctccgct ggaagaatac agcctgaacg gtctggtcag cattggtttt ccgctgttgc 4980
tgatcaccgg ctatgttgcg attgctgaga atgaagcagc cctggataaa gtccacccgc 5040
tgccggacct gctgcattat tccagcttgc tgagccgtct gattaatgat atcggcacta 5100
gcccggatga aatggcgcgt ggtgacaatc tgaagagcat tcactgctat atgaatgaaa 5160
ccggtgccag cgaagaggtc gcacgcgagc acatcaaagg cgtcatcgaa gagaattgga 5220
aaattctgaa ccagtgttgc tttgaccagt cccagttcca ggagccgttc atcacgttta 5280
acctgaacag cgtgcgcggc tcgcatttct tctatgaatt tggtgatggt tttggtgtta 5340
ccgacagctg gaccaaggtg gatatgaaaa gcgtcctgat tgatccgatt ccgctgggtg 5400
aagagtaagc ttgc 5414
<210> 25
<211> 5361
<212> DNA
<213> 人工序列
<220>
<223> 包含CYP71AV-P2O、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的ClASS的合成操纵子的DNA序列
<400> 25
catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60
accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120
ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gcttgaagga gatatactaa tgtctaccca gcaggttagc 3720
tccgagaata tcgttcgcaa cgcggcgaac ttccacccga atatctgggg taatcatttc 3780
ttgacgtgtc caagccagac gatcgattct tggacgcaac aacaccataa agagctgaaa 3840
gaagaggtcc gcaagatgat ggtgagcgac gcaaacaaac cggcacaacg tctgcgtctg 3900
attgacaccg ttcaacgttt gggcgtggcg tatcatttcg aaaaagaaat cgatgacgct 3960
ctggaaaaga tcggtcacga tccgtttgac gataaggatg acctgtatat cgttagcctg 4020
tgttttcgcc tgctgcgtca gcatggcatc aagattagct gcgatgtttt tgagaagttc 4080
aaagacgacg atggcaagtt taaggcttcc ctgatgaatg atgtccaagg tatgctgtcg 4140
ttgtatgaag cggcccacct ggcaattcat ggcgaggaca tcctggatga ggctattgtc 4200
tttacgacca cccacctgaa gagcaccgtt tctaactccc cggtcaattc cacctttgcg 4260
gaacagattc gccacagcct gcgtgtgccg ctgcgtaagg cagtcccgcg tttggagagc 4320
cgctacttcc tggatatcta tagccgtgac gacctgcacg acaagactct gctgaacttt 4380
gccaaactgg acttcaacat cctgcaggcg atgcaccaga aagaggcaag cgagatgacc 4440
cgttggtggc gtgatttcga tttcctgaag aagctgccgt acattcgtga tcgcgtggtt 4500
gaactgtact tttggatttt ggtcggtgtg agctaccaac cgaaattcag cacgggtcgt 4560
atctttttga gcaagattat ctgtctggaa accctggtgg acgacacgtt tgatgcgtac 4620
ggtactttcg acgaactggc cattttcacc gaggccgtta cgcgttggga cctgggtcat 4680
cgcgacgcgc tgcctgagta catgaaattc attttcaaga ccctgattga tgtgtacagc 4740
gaggcggaac aagagctggc aaaagagggc cgctcctata gcattcacta tgcgatccgt 4800
agcttccagg agttggtcat gaagtacttt tgcgaggcga aatggctgaa taagggttat 4860
gttccgagcc tggatgacta caagagcgtc agcctgcgca gcatcggctt cctgccgatc 4920
gccgtggctt cttttgtttt catgggcgac attgctacga aagaggtttt tgagtgggaa 4980
atgaataacc cgaaaatcat catcgcagcc gaaaccattt tccgctttct ggatgacatt 5040
gcaggtcatc gcttcgaaca aaaacgtgag cacagcccga gcgcaatcga gtgctacaaa 5100
aaccaacatg gtgtctcgga agaagaggca gtgaaagcgc tgagcttgga ggtcgccaat 5160
tcgtggaaag acattaacga agagctgctg ctgaacccta tggcaattcc actgccgttg 5220
ctgcaggtga tcctggattt gagccgtagc gcggacttca tgtacggtaa tgcgcaggac 5280
cgtttcacgc actccaccat gatgaaagat caagttgacc tggttctgaa agatccggtg 5340
aaactggacg attaagaatt c 5361
<210> 26
<211> 5414
<212> DNA
<213> 人工序列
<220>
<223> 包含CYP71AV-P2O、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的SaSAS的合成操纵子的DNA序列
<400> 26
catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60
accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120
ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180
cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240
gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300
atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360
gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420
gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480
aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540
attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600
caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660
gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720
accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780
aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840
gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900
ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960
cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020
caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080
ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140
ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200
ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260
accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320
ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380
aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440
gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500
gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560
cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620
caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680
gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740
cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800
aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860
acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920
tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980
tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040
ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100
ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160
catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220
ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280
cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340
gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400
ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460
gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520
gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580
ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640
ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700
ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760
accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820
tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880
gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940
ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000
ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060
tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120
tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180
gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240
ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300
agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360
tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420
gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480
gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540
aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600
atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660
cgtgatgtgt ggtaataaaa gcttaggagg taaaacatat ggacagcagc accgccaccg 3720
caatgaccgc accattcatc gacccgacgg atcatgtgaa tctgaaaacc gacacggatg 3780
cgagcgaaaa tcgtcgtatg ggtaactaca agccgagcat ttggaactac gattttctgc 3840
agtccctggc gacgcaccac aacattgttg aagagcgtca cctgaagctg gcagagaaac 3900
tgaaaggtca agtgaaattc atgttcggtg cgccgatgga gccattggct aagttggagc 3960
tggttgatgt ggtgcaacgc ttgggtctga accacctgtt cgagactgaa atcaaagaag 4020
ctctgttcag catctacaaa gatggcagca atggctggtg gtttggccat ctgcatgcta 4080
cctctttgcg cttccgtctg ttgcgccaat gtggcctgtt tatcccgcag gacgttttca 4140
aaacctttca aaacaagacc ggtgagtttg acatgaagct gtgggacaac gttaagggcc 4200
tgctgagcct gtacgaggcg agctacctgg gctggaaggg cgagaacatc ttggatgaag 4260
caaaggcgtt cacgaccaag tgcctgaaga gcgcatggga gaacattagc gagaagtggc 4320
tggcgaagcg tgttaaacat gcgttggcgc tgccgctgca ctggcgtgtt ccgcgtattg 4380
aagcacgctg gtttatcgag gtgtacgaac aagaggccaa tatgaatccg acgctgctga 4440
aactggcgaa actggacttc aacatggtcc aaagcattca ccagaaagaa atcggtgaac 4500
tggcccgctg gtgggttact accggcctgg acaagctgga tttcgcacgc aacaatctgt 4560
tgcagtctta tatgtggagc tgcgccatcg cgtccgaccc gaaattcaaa ctggcgcgtg 4620
aaaccattgt cgagatcggt tccgtgttga cggttgtcga cgacggctat gatgtgtacg 4680
gttctatgga tgagctggac ctgtacacca gctcggtgga gcgttggtcc tgtgtcaaaa 4740
ttgacaagct gcctaatacg ctgaagctga tctttatgtc tatgttcaac aaaaccaacg 4800
aggtgggtct gcgtgttcaa cacgagcgtg gttacaatag catcccgacc ttcattaagg 4860
cgtgggtgga acagtgtaag agctatcaaa aagaggcgcg ttggtttcat ggtggtcaca 4920
cgcctccgct ggaagaatac agcctgaacg gtctggtcag cattggtttt ccgctgttgc 4980
tgatcaccgg ctatgttgcg attgctgaga atgaagcagc cctggataaa gtccacccgc 5040
tgccggacct gctgcattat tccagcttgc tgagccgtct gattaatgat atcggcacta 5100
gcccggatga aatggcgcgt ggtgacaatc tgaagagcat tcactgctat atgaatgaaa 5160
ccggtgccag cgaagaggtc gcacgcgagc acatcaaagg cgtcatcgaa gagaattgga 5220
aaattctgaa ccagtgttgc tttgaccagt cccagttcca ggagccgttc atcacgttta 5280
acctgaacag cgtgcgcggc tcgcatttct tctatgaatt tggtgatggt tttggtgtta 5340
ccgacagctg gaccaaggtg gatatgaaaa gcgtcctgat tgatccgatt ccgctgggtg 5400
aagagtaagc ttgc 5414
<210> 27
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358A DNA 序列
<400> 27
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgg ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 28
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358A 氨基酸序列
<400> 28
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ala Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 29
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358F DNA 序列
<400> 29
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500
agcacctaat ga 1512
<210> 30
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358F 氨基酸序列
<400> 30
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Phe Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 31
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358T DNA 序列
<400> 31
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccga ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 32
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358T 氨基酸序列
<400> 32
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Thr Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 33
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358S DNA 序列
<400> 33
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 34
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358S 氨基酸序列
<400> 34
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ser Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 35
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358V DNA 序列
<400> 35
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgg ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 36
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358V 氨基酸序列
<400> 36
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Val Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 37
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358G DNA 序列
<400> 37
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgg ggccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 38
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358G 氨基酸序列
<400> 38
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Gly Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 39
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358I DNA 序列
<400> 39
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccga ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 40
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358I 氨基酸序列
<400> 40
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ile Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 41
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358M DNA 序列
<400> 41
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccga tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 42
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358M 氨基酸序列
<400> 42
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Met Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 43
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358P DNA 序列
<400> 43
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgc ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 44
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358P 氨基酸序列
<400> 44
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Pro Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 45
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358Y DNA 序列
<400> 45
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt atccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 46
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358Y 氨基酸序列
<400> 46
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Tyr Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 47
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358W DNA 序列
<400> 47
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgt ggccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 48
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358W 氨基酸序列
<400> 48
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Trp Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 49
<211> 1512
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV8-L358R DNA 序列
<400> 49
atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60
atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120
ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180
ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240
tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300
accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360
atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420
ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480
ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540
ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600
atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660
gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720
aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780
cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840
gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900
acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960
gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020
gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080
catccgccgc gtccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140
tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200
gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260
gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320
gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380
tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440
actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500
agcacctaat ga 1512
<210> 50
<211> 502
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV8-L358R 氨基酸序列
<400> 50
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys
20 25 30
Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly
35 40 45
His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu
50 55 60
Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val
65 70 75 80
Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr
85 90 95
Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu
100 105 110
Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu
115 120 125
Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn
130 135 140
Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn
145 150 155 160
Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu
165 170 175
Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala
180 185 190
Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys
195 200 205
Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro
210 215 220
Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr
225 230 235 240
Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu
245 250 255
His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val
260 265 270
Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn
275 280 285
Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser
290 295 300
Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg
305 310 315 320
Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys
325 330 335
Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu
340 345 350
Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Arg Pro Leu Val Met
355 360 365
Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro
370 375 380
Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro
385 390 395 400
Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn
405 410 415
Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly
420 425 430
Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val
435 440 445
Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro
450 455 460
Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala
465 470 475 480
Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln
485 490 495
Thr Leu Thr Ala Ser Thr
500
<210> 51
<211> 1488
<212> DNA
<213> 黄花蒿(Artemisia annua)
<400> 51
atgaagagta tactaaaagc aatggcactc tcactgacca cttccattgc tcttgcaacg 60
atccttttgt tcgtttacaa gttcgctact cgttccaaat ccaccaaaaa aagccttcct 120
gagccatggc ggcttcccat tattggtcac atgcatcact tgattggtac aacgccacat 180
cgtggggtta gggatttagc cagaaagtat ggatctttga tgcatttaca gcttggtgaa 240
gttccaacaa tcgtggtgtc atctccgaaa tgggctaaag agattttgac aacgtacgac 300
attacctttg ctaacaggcc cgagacttta actggtgaga ttgttttata tcacaatacg 360
gatgttgttc ttgcacctta tggtgaatac tggaggcaat tacgtaaaat ttgcacattg 420
gagcttttga gtgttaagaa agtaaagtca tttcagtcac ttcgtgaaga ggagtgttgg 480
aatttggttc aagagattaa agcttcaggt tcagggagac cggttaacct ttcagagaat 540
gttttcaagt tgattgcaac gatacttagt agagccgcat ttgggaaagg gatcaaggac 600
cagaaagagt taacggagat tgtgaaagag atactgaggc aaactggtgg ttttgatgtg 660
gcagatatct ttccttcaaa gaaatttctt catcatcttt cgggcaagag agctcggtta 720
actagccttc gcaaaaagat cgataattta atcgataacc ttgtagctga gcatactgtt 780
aacacctcca gtaaaactaa cgagacactc ctcgatgttc ttttaaggct caaagacagt 840
gctgaattcc cattaacatc tgataacatt aaagccatca ttttggatat gtttggagca 900
ggcacagaca cttcctcatc cacaatcgaa tgggcgattt cggaactcat aaagtgtccg 960
aaagcaatgg agaaagtaca agcggaattg aggaaagcat tgaacggaaa agaaaagatc 1020
catgaggaag acattcaaga actaagctac ttgaacatgg taatcaaaga aacattgagg 1080
ttgcaccctc cactaccctt ggttctgcca agagagtgcc gccaaccagt caatttggct 1140
ggatacaaca tacccaataa gaccaaactt attgtcaacg tctttgcgat aaatagggac 1200
cctgaatatt ggaaagacgc tgaagctttc atccctgaac gatttgaaaa tagttctgca 1260
actgtcatgg gtgcagaata cgagtatctt ccgtttggag ctgggagaag gatgtgtcct 1320
ggagccgcac ttggtttagc taacgtgcag ctcccgctcg ctaatatact atatcatttc 1380
aactggaaac tccccaatgg tgtgagctat gaccagatcg acatgaccga gagctctgga 1440
gccacgatgc aaagaaagac tgagttgtta ctcgttccaa gtttctag 1488
<210> 52
<211> 495
<212> PRT
<213> 黄花蒿(Artemisia annua)
<400> 52
Met Lys Ser Ile Leu Lys Ala Met Ala Leu Ser Leu Thr Thr Ser Ile
1 5 10 15
Ala Leu Ala Thr Ile Leu Leu Phe Val Tyr Lys Phe Ala Thr Arg Ser
20 25 30
Lys Ser Thr Lys Lys Ser Leu Pro Glu Pro Trp Arg Leu Pro Ile Ile
35 40 45
Gly His Met His His Leu Ile Gly Thr Thr Pro His Arg Gly Val Arg
50 55 60
Asp Leu Ala Arg Lys Tyr Gly Ser Leu Met His Leu Gln Leu Gly Glu
65 70 75 80
Val Pro Thr Ile Val Val Ser Ser Pro Lys Trp Ala Lys Glu Ile Leu
85 90 95
Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly
100 105 110
Glu Ile Val Leu Tyr His Asn Thr Asp Val Val Leu Ala Pro Tyr Gly
115 120 125
Glu Tyr Trp Arg Gln Leu Arg Lys Ile Cys Thr Leu Glu Leu Leu Ser
130 135 140
Val Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp
145 150 155 160
Asn Leu Val Gln Glu Ile Lys Ala Ser Gly Ser Gly Arg Pro Val Asn
165 170 175
Leu Ser Glu Asn Val Phe Lys Leu Ile Ala Thr Ile Leu Ser Arg Ala
180 185 190
Ala Phe Gly Lys Gly Ile Lys Asp Gln Lys Glu Leu Thr Glu Ile Val
195 200 205
Lys Glu Ile Leu Arg Gln Thr Gly Gly Phe Asp Val Ala Asp Ile Phe
210 215 220
Pro Ser Lys Lys Phe Leu His His Leu Ser Gly Lys Arg Ala Arg Leu
225 230 235 240
Thr Ser Leu Arg Lys Lys Ile Asp Asn Leu Ile Asp Asn Leu Val Ala
245 250 255
Glu His Thr Val Asn Thr Ser Ser Lys Thr Asn Glu Thr Leu Leu Asp
260 265 270
Val Leu Leu Arg Leu Lys Asp Ser Ala Glu Phe Pro Leu Thr Ser Asp
275 280 285
Asn Ile Lys Ala Ile Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr
290 295 300
Ser Ser Ser Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Lys Cys Pro
305 310 315 320
Lys Ala Met Glu Lys Val Gln Ala Glu Leu Arg Lys Ala Leu Asn Gly
325 330 335
Lys Glu Lys Ile His Glu Glu Asp Ile Gln Glu Leu Ser Tyr Leu Asn
340 345 350
Met Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val
355 360 365
Leu Pro Arg Glu Cys Arg Gln Pro Val Asn Leu Ala Gly Tyr Asn Ile
370 375 380
Pro Asn Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp
385 390 395 400
Pro Glu Tyr Trp Lys Asp Ala Glu Ala Phe Ile Pro Glu Arg Phe Glu
405 410 415
Asn Ser Ser Ala Thr Val Met Gly Ala Glu Tyr Glu Tyr Leu Pro Phe
420 425 430
Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn
435 440 445
Val Gln Leu Pro Leu Ala Asn Ile Leu Tyr His Phe Asn Trp Lys Leu
450 455 460
Pro Asn Gly Val Ser Tyr Asp Gln Ile Asp Met Thr Glu Ser Ser Gly
465 470 475 480
Ala Thr Met Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Ser Phe
485 490 495
<210> 53
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> CYP71AV1密码子优化的DNA 序列
<400> 53
atgaccgtac acgacatcat cgcaacgtac ttcactaaat ggtacgtaat tgtgccgctg 60
gcactgattg cgtatcgcgt gctggattat ttctacgcga cccgttctaa aagcactaag 120
aaatctctgc cggaaccgtg gcgtctgcca atcatcggtc acatgcacca cctgatcggc 180
accaccccgc accgtggcgt acgcgacctg gcgcgtaagt acggctctct gatgcatctg 240
cagctgggcg aggtacctac tatcgtcgtt tcctccccga agtgggccaa agaaatcctg 300
actacctatg acatcacttt cgccaaccgc ccggaaacgc tgaccggcga aattgtcctg 360
taccataaca cggatgtggt tctggccccg tacggtgagt actggcgcca gctgcgcaaa 420
atttgtactc tggaactgct gagcgttaaa aaggttaaat ccttccagag cctgcgtgaa 480
gaggaatgct ggaacctggt gcaggagatt aaagcgtctg gcagcggtcg tccagttaac 540
ctgtctgaga atgtttttaa actgatcgct actatcctgt ctcgcgcggc attcggtaaa 600
ggtatcaaag atcagaaaga actgaccgaa atcgttaagg aaatcctgcg ccagactggt 660
ggcttcgacg ttgcggacat cttcccgtcc aaaaagttcc tgcaccatct gtctggcaaa 720
cgcgctcgtc tgacctccct gcgtaagaaa attgataacc tgattgacaa cctggtcgct 780
gagcacactg tgaacacctc ttctaaaacc aacgaaaccc tgctggacgt actgctgcgc 840
ctgaaggact ctgccgaatt tccactgact agcgacaata tcaaagcaat catcctggac 900
atgttcggcg ccggtaccga tacgtcctct tccacgattg agtgggctat ttccgaactg 960
atcaaatgcc cgaaggcgat ggaaaaagtg caggcggaac tgcgtaaagc gctgaacggt 1020
aaagagaaaa ttcatgaaga ggacatccag gaactgtcct acctgaatat ggtaatcaaa 1080
gaaactctgc gtctgcatcc gccgctgcca ctggttctgc cgcgtgaatg ccgtcagccg 1140
gttaacctgg ccggctacaa cattccgaac aaaacgaagc tgatcgtcaa cgttttcgcg 1200
atcaaccgcg atcctgaata ctggaaagac gcggaagcgt tcattccgga acgctttgag 1260
aactcctctg ccaccgttat gggcgctgaa tacgagtacc tgccgttcgg tgcgggtcgc 1320
cgtatgtgcc cgggtgctgc actgggcctg gcgaacgttc aactgccact ggcgaacatc 1380
ctgtaccact tcaactggaa actgcctaac ggcgtatctt atgatcaaat cgacatgacc 1440
gaaagctccg gcgcgaccat gcagcgtaaa accgaactgc tgctggttcc gtccttttaa 1500
<210> 54
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> CYP71AV1密码子优化的氨基酸序列
<400> 54
Met Thr Val His Asp Ile Ile Ala Thr Tyr Phe Thr Lys Trp Tyr Val
1 5 10 15
Ile Val Pro Leu Ala Leu Ile Ala Tyr Arg Val Leu Asp Tyr Phe Tyr
20 25 30
Ala Thr Arg Ser Lys Ser Thr Lys Lys Ser Leu Pro Glu Pro Trp Arg
35 40 45
Leu Pro Ile Ile Gly His Met His His Leu Ile Gly Thr Thr Pro His
50 55 60
Arg Gly Val Arg Asp Leu Ala Arg Lys Tyr Gly Ser Leu Met His Leu
65 70 75 80
Gln Leu Gly Glu Val Pro Thr Ile Val Val Ser Ser Pro Lys Trp Ala
85 90 95
Lys Glu Ile Leu Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu
100 105 110
Thr Leu Thr Gly Glu Ile Val Leu Tyr His Asn Thr Asp Val Val Leu
115 120 125
Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg Lys Ile Cys Thr Leu
130 135 140
Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu
145 150 155 160
Glu Glu Cys Trp Asn Leu Val Gln Glu Ile Lys Ala Ser Gly Ser Gly
165 170 175
Arg Pro Val Asn Leu Ser Glu Asn Val Phe Lys Leu Ile Ala Thr Ile
180 185 190
Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys Asp Gln Lys Glu Leu
195 200 205
Thr Glu Ile Val Lys Glu Ile Leu Arg Gln Thr Gly Gly Phe Asp Val
210 215 220
Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His His Leu Ser Gly Lys
225 230 235 240
Arg Ala Arg Leu Thr Ser Leu Arg Lys Lys Ile Asp Asn Leu Ile Asp
245 250 255
Asn Leu Val Ala Glu His Thr Val Asn Thr Ser Ser Lys Thr Asn Glu
260 265 270
Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asp Ser Ala Glu Phe Pro
275 280 285
Leu Thr Ser Asp Asn Ile Lys Ala Ile Ile Leu Asp Met Phe Gly Ala
290 295 300
Gly Thr Asp Thr Ser Ser Ser Thr Ile Glu Trp Ala Ile Ser Glu Leu
305 310 315 320
Ile Lys Cys Pro Lys Ala Met Glu Lys Val Gln Ala Glu Leu Arg Lys
325 330 335
Ala Leu Asn Gly Lys Glu Lys Ile His Glu Glu Asp Ile Gln Glu Leu
340 345 350
Ser Tyr Leu Asn Met Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro
355 360 365
Leu Pro Leu Val Leu Pro Arg Glu Cys Arg Gln Pro Val Asn Leu Ala
370 375 380
Gly Tyr Asn Ile Pro Asn Lys Thr Lys Leu Ile Val Asn Val Phe Ala
385 390 395 400
Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala Glu Ala Phe Ile Pro
405 410 415
Glu Arg Phe Glu Asn Ser Ser Ala Thr Val Met Gly Ala Glu Tyr Glu
420 425 430
Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu
435 440 445
Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn Ile Leu Tyr His Phe
450 455 460
Asn Trp Lys Leu Pro Asn Gly Val Ser Tyr Asp Gln Ile Asp Met Thr
465 470 475 480
Glu Ser Ser Gly Ala Thr Met Gln Arg Lys Thr Glu Leu Leu Leu Val
485 490 495
Pro Ser Phe
<210> 55
<211> 3150
<212> DNA
<213> 巨大芽孢杆菌(Bacillus megaterium)
<400> 55
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactgctcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 56
<211> 1049
<212> PRT
<213> 巨大芽孢杆菌(Bacillus megaterium)
<400> 56
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 57
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体7 DNA 序列
<400> 57
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt aatcacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactatccct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 58
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体7 氨基酸序列
<400> 58
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Ile Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 59
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体17 DNA 序列
<400> 59
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactatccct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 60
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体17 氨基酸序列
<400> 60
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Ile Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 61
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体18 DNA 序列
<400> 61
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactctgcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 62
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体18 氨基酸序列
<400> 62
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Leu Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 63
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体19 DNA 序列
<400> 63
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactgttcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 64
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体19 氨基酸序列
<400> 64
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Val Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 65
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体20 DNA 序列
<400> 65
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aacttttcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 66
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体20 氨基酸序列
<400> 66
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Phe Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 67
<211> 3150
<212> DNA
<213> 人工序列
<220>
<223> P450-BM3变体23 DNA 序列
<400> 67
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720
ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactgttcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctgggtaa 3150
<210> 68
<211> 1049
<212> PRT
<213> 人工序列
<220>
<223> P450-BM3变体23 氨基酸序列
<400> 68
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Val Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 69
<211> 2051
<212> DNA
<213> 檀香树(Santalum album)
<400> 69
atgtacgtat ccatcagcaa tgatcgacct tataaaggag ccgagacact ctcaccttca 60
atccactcat ccctacattc ttttgctaac tcctttgttg ccagcaagta tatctcttac 120
gttaaacgtt ttacttcctc aacatgtctc cggcaacagc cgttatcctc actctcctcg 180
tggccctagg gctatccatc cttttgcggc ggcgccaaaa aagaaataat ctacctcccg 240
gtccacccgc tttaccgatc atcggaaaca tccacatatt ggggaccctt cctcaccaga 300
gcctctacaa cttggccaag aagtatggtc ccatcatgtc aatgaggctg gggctcgtgc 360
cggctgttgt gatatcctct ccggaggccg ccgagctcgt cctcaagacc cacgatatcg 420
ttttcgccag ccggcccaga ctccaagttg cggactactt ccattacggg acaaagggcg 480
tcatcctgac ggagtatggt acatattggc gcaacatgcg aaggctgtgc accgtgaagc 540
ttctcaacac ggtgaaaatc gattctttcg cagggacaag gaagaaggag gtggcatcgt 600
tcgtgcagtc ccttaaggag gcttcggtgg cacacaaaat ggtgaatttg agcgcgaggg 660
tggcgaacgt cattgaaaac atggtgtgcc ttatggtgat cgggcgaagt agcgatgaga 720
ggtttaagct aaaggaggtc atccaggagg cagcgcagtt ggcgggagct ttcaatatag 780
gggattatgt tccattcctt atgccccttg acctacaggg attaactcgg cgcataaagt 840
caggaagtaa agctttcgac gacatcttgg aagtcataat cgacgagcac gtgcaagaca 900
ttaaggacca tgatgatgaa caacatggag acttcattga tgtgttgctg gcaatgatga 960
acaagcccat ggattcgcgg gagggtctta gtatcattga ccgaacaaac atcaaagcga 1020
tcctagtgga catgattgga gctgcaatgg acacttcaac aagtggcgtc gagtgggcga 1080
tttcagagct catcaagcat ccgcgggtaa tgaaaaagct ccaagacgag gtcaaaactg 1140
tcatcggaat gaataggatg gtcgaggagg ccgacttgcc taagctacca tacctcgaca 1200
tggtagtgaa agagaccatg aggttacacc ctcctggacc attgctcgtg ccccgagagt 1260
ccatggaaga catcacaatc aacggatact acatacctaa gaaatcgcga atcattgtca 1320
acgcctgggc aattgggcgt gatacaaacg cctggtctaa taacgcgcac gagttcttcc 1380
cagagaggtt tatgagtagc aatgtggact tacagggaca agatttccaa cttatcccat 1440
tcgggtcagg tcggagaggg tgccccggga tgcgcctagg cctcacaacc gttcgattag 1500
tgttagcgca gctcattcat tgtttcgact tggagcttcc taagggaacc gtggcgaccg 1560
acttggacat gagtgagaaa ttcgggttgg caatgcccag agcccagcac ttgcttgcat 1620
ttccaaccta tcgcttggag tcctaaacca ttgaggaaga tgcgtttata tttcatattg 1680
cagtgttaca ataagtagca gtcgttttca tggtgaagag gcaattcccc ctacactacc 1740
tgtcttatgc tatgcccctc cccaactttc accgtatgtg tcttgtcatc atgtatcatg 1800
tccacatcaa taagatatta tatagaaatt gtcggtacgc caagatcgga ctcaatatgt 1860
atcagctttg agctctgtac acaaaatttg atacacgaac agagaaggtc gcgaattttg 1920
ggccactcgt ctcagatata tacccttcaa gtggctaatg gggagatccc tctcctttgc 1980
atttaaagcc tctgcttccc gaaccctagc ccacaaaatt ttggccgaaa ccggataggc 2040
atacacgaca g 2051
<210> 70
<211> 1503
<212> DNA
<213> 檀香树(Santalum album)
<400> 70
atgtctccgg caacagccgt tatcctcact ctcctcgtgg ccctagggct atccatcctt 60
ttgcggcggc gccaaaaaag aaataatcta cctcccggtc cacccgcttt accgatcatc 120
ggaaacatcc acatattggg gacccttcct caccagagcc tctacaactt ggccaagaag 180
tatggtccca tcatgtcaat gaggctgggg ctcgtgccgg ctgttgtgat atcctctccg 240
gaggccgccg agctcgtcct caagacccac gatatcgttt tcgccagccg gcccagactc 300
caagttgcgg actacttcca ttacgggaca aagggcgtca tcctgacgga gtatggtaca 360
tattggcgca acatgcgaag gctgtgcacc gtgaagcttc tcaacacggt gaaaatcgat 420
tctttcgcag ggacaaggaa gaaggaggtg gcatcgttcg tgcagtccct taaggaggct 480
tcggtggcac acaaaatggt gaatttgagc gcgagggtgg cgaacgtcat tgaaaacatg 540
gtgtgcctta tggtgatcgg gcgaagtagc gatgagaggt ttaagctaaa ggaggtcatc 600
caggaggcag cgcagttggc gggagctttc aatatagggg attatgttcc attccttatg 660
ccccttgacc tacagggatt aactcggcgc ataaagtcag gaagtaaagc tttcgacgac 720
atcttggaag tcataatcga cgagcacgtg caagacatta aggaccatga tgatgaacaa 780
catggagact tcattgatgt gttgctggca atgatgaaca agcccatgga ttcgcgggag 840
ggtcttagta tcattgaccg aacaaacatc aaagcgatcc tagtggacat gattggagct 900
gcaatggaca cttcaacaag tggcgtcgag tgggcgattt cagagctcat caagcatccg 960
cgggtaatga aaaagctcca agacgaggtc aaaactgtca tcggaatgaa taggatggtc 1020
gaggaggccg acttgcctaa gctaccatac ctcgacatgg tagtgaaaga gaccatgagg 1080
ttacaccctc ctggaccatt gctcgtgccc cgagagtcca tggaagacat cacaatcaac 1140
ggatactaca tacctaagaa atcgcgaatc attgtcaacg cctgggcaat tgggcgtgat 1200
acaaacgcct ggtctaataa cgcgcacgag ttcttcccag agaggtttat gagtagcaat 1260
gtggacttac agggacaaga tttccaactt atcccattcg ggtcaggtcg gagagggtgc 1320
cccgggatgc gcctaggcct cacaaccgtt cgattagtgt tagcgcagct cattcattgt 1380
ttcgacttgg agcttcctaa gggaaccgtg gcgaccgact tggacatgag tgagaaattc 1440
gggttggcaa tgcccagagc ccagcacttg cttgcatttc caacctatcg cttggagtcc 1500
taa 1503
<210> 71
<211> 500
<212> PRT
<213> 檀香树(Santalum album)
<400> 71
Met Ser Pro Ala Thr Ala Val Ile Leu Thr Leu Leu Val Ala Leu Gly
1 5 10 15
Leu Ser Ile Leu Leu Arg Arg Arg Gln Lys Arg Asn Asn Leu Pro Pro
20 25 30
Gly Pro Pro Ala Leu Pro Ile Ile Gly Asn Ile His Ile Leu Gly Thr
35 40 45
Leu Pro His Gln Ser Leu Tyr Asn Leu Ala Lys Lys Tyr Gly Pro Ile
50 55 60
Met Ser Met Arg Leu Gly Leu Val Pro Ala Val Val Ile Ser Ser Pro
65 70 75 80
Glu Ala Ala Glu Leu Val Leu Lys Thr His Asp Ile Val Phe Ala Ser
85 90 95
Arg Pro Arg Leu Gln Val Ala Asp Tyr Phe His Tyr Gly Thr Lys Gly
100 105 110
Val Ile Leu Thr Glu Tyr Gly Thr Tyr Trp Arg Asn Met Arg Arg Leu
115 120 125
Cys Thr Val Lys Leu Leu Asn Thr Val Lys Ile Asp Ser Phe Ala Gly
130 135 140
Thr Arg Lys Lys Glu Val Ala Ser Phe Val Gln Ser Leu Lys Glu Ala
145 150 155 160
Ser Val Ala His Lys Met Val Asn Leu Ser Ala Arg Val Ala Asn Val
165 170 175
Ile Glu Asn Met Val Cys Leu Met Val Ile Gly Arg Ser Ser Asp Glu
180 185 190
Arg Phe Lys Leu Lys Glu Val Ile Gln Glu Ala Ala Gln Leu Ala Gly
195 200 205
Ala Phe Asn Ile Gly Asp Tyr Val Pro Phe Leu Met Pro Leu Asp Leu
210 215 220
Gln Gly Leu Thr Arg Arg Ile Lys Ser Gly Ser Lys Ala Phe Asp Asp
225 230 235 240
Ile Leu Glu Val Ile Ile Asp Glu His Val Gln Asp Ile Lys Asp His
245 250 255
Asp Asp Glu Gln His Gly Asp Phe Ile Asp Val Leu Leu Ala Met Met
260 265 270
Asn Lys Pro Met Asp Ser Arg Glu Gly Leu Ser Ile Ile Asp Arg Thr
275 280 285
Asn Ile Lys Ala Ile Leu Val Asp Met Ile Gly Ala Ala Met Asp Thr
290 295 300
Ser Thr Ser Gly Val Glu Trp Ala Ile Ser Glu Leu Ile Lys His Pro
305 310 315 320
Arg Val Met Lys Lys Leu Gln Asp Glu Val Lys Thr Val Ile Gly Met
325 330 335
Asn Arg Met Val Glu Glu Ala Asp Leu Pro Lys Leu Pro Tyr Leu Asp
340 345 350
Met Val Val Lys Glu Thr Met Arg Leu His Pro Pro Gly Pro Leu Leu
355 360 365
Val Pro Arg Glu Ser Met Glu Asp Ile Thr Ile Asn Gly Tyr Tyr Ile
370 375 380
Pro Lys Lys Ser Arg Ile Ile Val Asn Ala Trp Ala Ile Gly Arg Asp
385 390 395 400
Thr Asn Ala Trp Ser Asn Asn Ala His Glu Phe Phe Pro Glu Arg Phe
405 410 415
Met Ser Ser Asn Val Asp Leu Gln Gly Gln Asp Phe Gln Leu Ile Pro
420 425 430
Phe Gly Ser Gly Arg Arg Gly Cys Pro Gly Met Arg Leu Gly Leu Thr
435 440 445
Thr Val Arg Leu Val Leu Ala Gln Leu Ile His Cys Phe Asp Leu Glu
450 455 460
Leu Pro Lys Gly Thr Val Ala Thr Asp Leu Asp Met Ser Glu Lys Phe
465 470 475 480
Gly Leu Ala Met Pro Arg Ala Gln His Leu Leu Ala Phe Pro Thr Tyr
485 490 495
Arg Leu Glu Ser
500
<210> 72
<211> 1534
<212> DNA
<213> 人工序列
<220>
<223> SaCP120293, 用作SaCP816优化的DNA序列
<400> 72
aggaggtaaa acatatggca ctgttgttgg cggttttctg gagcgctttg attattctgg 60
ttagcatctt attgcgtcgt cgtcaaaaac gcaacaattt gccaccgggc ccaccggccc 120
tgccgatcat cggtaacatt cacattctgg gcaccctgcc gcaccagagc ctgtacaatc 180
tggcgaagaa gtacggtccg atcatgtcca tgcgtttggg cttggttccg gcggtggtca 240
tcagcagccc ggaagcggcc gagctggtcc tgaaaaccca cgacatcgtt tttgcttctc 300
gccctcgtct gcaagttgca gattactttc actatggcac caaaggcgtg attctgaccg 360
aatatggtac ctactggcgt aacatgcgtc gcctgtgcac ggtcaaactg ctgaacaccg 420
ttaagattga tagctttgca ggcacccgca agaaagaagt cgctagcttc gttcagagcc 480
tgaaagaagc aagcgtggcg cacaaaatgg ttaacctgtc cgcacgcgtc gctaatgtta 540
ttgagaatat ggtttgtctg atggttattg gtagatcgtc tgacgagcgt ttcaagctga 600
aagaagtgat ccaagaagcg gcacagctgg cgggtgcctt caatattggt gactatgtcc 660
cgtttctgat gccgctggat ctgcagggcc tgactcgccg tatcaagagc ggtagcaagg 720
cattcgatga catcctcgag gtcattatcg acgagcatgt gcaagacatt aaagatcatg 780
acgatgagca gcatggtgac ttcatcgacg tgctgctggc gatgatgaat aagccgatgg 840
attctcgtga gggtctgtcc atcattgatc gcacgaacat taaagcgatc ctggtggata 900
tgatcggtgc cgcgatggac acgagcacca gcggtgtgga gtgggcgatt tcggagctga 960
ttaagcatcc tcgtgtcatg aagaaactgc aagacgaagt gaaaaccgta atcggtatga 1020
accgcatggt ggaagaagcg gatctgccga aactgccgta cctggacatg gttgtcaagg 1080
aaacgatgcg tctgcatccg ccaggcccgc tgctggtgcc gcgtgaaagc atggaagata 1140
ttacgatcaa cggttactat atcccgaaga aatcccgcat tattgtgaat gcatgggcga 1200
tcggccgtga caccaacgcc tggagcaata atgcgcacga gtttttccct gagcgtttta 1260
tgagctctaa cgttgatctg caaggccagg acttccagct gatcccgttc ggtagcggtc 1320
gtcgcggttg tccgggcatg cgtctgggtc tgacgacggt ccgcttggtg ctggcccaac 1380
tgattcactg cttcgacctg gagcttccga agggcaccgt cgcgactgac ctggatatga 1440
gcgagaagtt tggtctggca atgccgcgtg cgcagcactt actggccttt ccgacctacc 1500
gtctggagag ctaagtcgac accatggaaa gctt 1534
<210> 73
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> SaCP120293 氨基酸序列, N-末端经修饰的SaCP816
<400> 73
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val
1 5 10 15
Ser Ile Leu Leu Arg Arg Arg Gln Lys Arg Asn Asn Leu Pro Pro Gly
20 25 30
Pro Pro Ala Leu Pro Ile Ile Gly Asn Ile His Ile Leu Gly Thr Leu
35 40 45
Pro His Gln Ser Leu Tyr Asn Leu Ala Lys Lys Tyr Gly Pro Ile Met
50 55 60
Ser Met Arg Leu Gly Leu Val Pro Ala Val Val Ile Ser Ser Pro Glu
65 70 75 80
Ala Ala Glu Leu Val Leu Lys Thr His Asp Ile Val Phe Ala Ser Arg
85 90 95
Pro Arg Leu Gln Val Ala Asp Tyr Phe His Tyr Gly Thr Lys Gly Val
100 105 110
Ile Leu Thr Glu Tyr Gly Thr Tyr Trp Arg Asn Met Arg Arg Leu Cys
115 120 125
Thr Val Lys Leu Leu Asn Thr Val Lys Ile Asp Ser Phe Ala Gly Thr
130 135 140
Arg Lys Lys Glu Val Ala Ser Phe Val Gln Ser Leu Lys Glu Ala Ser
145 150 155 160
Val Ala His Lys Met Val Asn Leu Ser Ala Arg Val Ala Asn Val Ile
165 170 175
Glu Asn Met Val Cys Leu Met Val Ile Gly Arg Ser Ser Asp Glu Arg
180 185 190
Phe Lys Leu Lys Glu Val Ile Gln Glu Ala Ala Gln Leu Ala Gly Ala
195 200 205
Phe Asn Ile Gly Asp Tyr Val Pro Phe Leu Met Pro Leu Asp Leu Gln
210 215 220
Gly Leu Thr Arg Arg Ile Lys Ser Gly Ser Lys Ala Phe Asp Asp Ile
225 230 235 240
Leu Glu Val Ile Ile Asp Glu His Val Gln Asp Ile Lys Asp His Asp
245 250 255
Asp Glu Gln His Gly Asp Phe Ile Asp Val Leu Leu Ala Met Met Asn
260 265 270
Lys Pro Met Asp Ser Arg Glu Gly Leu Ser Ile Ile Asp Arg Thr Asn
275 280 285
Ile Lys Ala Ile Leu Val Asp Met Ile Gly Ala Ala Met Asp Thr Ser
290 295 300
Thr Ser Gly Val Glu Trp Ala Ile Ser Glu Leu Ile Lys His Pro Arg
305 310 315 320
Val Met Lys Lys Leu Gln Asp Glu Val Lys Thr Val Ile Gly Met Asn
325 330 335
Arg Met Val Glu Glu Ala Asp Leu Pro Lys Leu Pro Tyr Leu Asp Met
340 345 350
Val Val Lys Glu Thr Met Arg Leu His Pro Pro Gly Pro Leu Leu Val
355 360 365
Pro Arg Glu Ser Met Glu Asp Ile Thr Ile Asn Gly Tyr Tyr Ile Pro
370 375 380
Lys Lys Ser Arg Ile Ile Val Asn Ala Trp Ala Ile Gly Arg Asp Thr
385 390 395 400
Asn Ala Trp Ser Asn Asn Ala His Glu Phe Phe Pro Glu Arg Phe Met
405 410 415
Ser Ser Asn Val Asp Leu Gln Gly Gln Asp Phe Gln Leu Ile Pro Phe
420 425 430
Gly Ser Gly Arg Arg Gly Cys Pro Gly Met Arg Leu Gly Leu Thr Thr
435 440 445
Val Arg Leu Val Leu Ala Gln Leu Ile His Cys Phe Asp Leu Glu Leu
450 455 460
Pro Lys Gly Thr Val Ala Thr Asp Leu Asp Met Ser Glu Lys Phe Gly
465 470 475 480
Leu Ala Met Pro Arg Ala Gln His Leu Leu Ala Phe Pro Thr Tyr Arg
485 490 495
Leu Glu Ser
<210> 74
<211> 3672
<212> DNA
<213> 人工序列
<220>
<223> 用作编码SaCP816和CPRm的合成操纵子
<400> 74
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc tt 3672
<210> 75
<211> 5349
<212> DNA
<213> 人工序列
<220>
<223> 用作编码SaCP816、CPRm和ClASS的合成操纵子
<400> 75
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttgaaggaga tatactaatg tctacccagc aggttagctc cgagaatatc 3720
gttcgcaacg cggcgaactt ccacccgaat atctggggta atcatttctt gacgtgtcca 3780
agccagacga tcgattcttg gacgcaacaa caccataaag agctgaaaga agaggtccgc 3840
aagatgatgg tgagcgacgc aaacaaaccg gcacaacgtc tgcgtctgat tgacaccgtt 3900
caacgtttgg gcgtggcgta tcatttcgaa aaagaaatcg atgacgctct ggaaaagatc 3960
ggtcacgatc cgtttgacga taaggatgac ctgtatatcg ttagcctgtg ttttcgcctg 4020
ctgcgtcagc atggcatcaa gattagctgc gatgtttttg agaagttcaa agacgacgat 4080
ggcaagttta aggcttccct gatgaatgat gtccaaggta tgctgtcgtt gtatgaagcg 4140
gcccacctgg caattcatgg cgaggacatc ctggatgagg ctattgtctt tacgaccacc 4200
cacctgaaga gcaccgtttc taactccccg gtcaattcca cctttgcgga acagattcgc 4260
cacagcctgc gtgtgccgct gcgtaaggca gtcccgcgtt tggagagccg ctacttcctg 4320
gatatctata gccgtgacga cctgcacgac aagactctgc tgaactttgc caaactggac 4380
ttcaacatcc tgcaggcgat gcaccagaaa gaggcaagcg agatgacccg ttggtggcgt 4440
gatttcgatt tcctgaagaa gctgccgtac attcgtgatc gcgtggttga actgtacttt 4500
tggattttgg tcggtgtgag ctaccaaccg aaattcagca cgggtcgtat ctttttgagc 4560
aagattatct gtctggaaac cctggtggac gacacgtttg atgcgtacgg tactttcgac 4620
gaactggcca ttttcaccga ggccgttacg cgttgggacc tgggtcatcg cgacgcgctg 4680
cctgagtaca tgaaattcat tttcaagacc ctgattgatg tgtacagcga ggcggaacaa 4740
gagctggcaa aagagggccg ctcctatagc attcactatg cgatccgtag cttccaggag 4800
ttggtcatga agtacttttg cgaggcgaaa tggctgaata agggttatgt tccgagcctg 4860
gatgactaca agagcgtcag cctgcgcagc atcggcttcc tgccgatcgc cgtggcttct 4920
tttgttttca tgggcgacat tgctacgaaa gaggtttttg agtgggaaat gaataacccg 4980
aaaatcatca tcgcagccga aaccattttc cgctttctgg atgacattgc aggtcatcgc 5040
ttcgaacaaa aacgtgagca cagcccgagc gcaatcgagt gctacaaaaa ccaacatggt 5100
gtctcggaag aagaggcagt gaaagcgctg agcttggagg tcgccaattc gtggaaagac 5160
attaacgaag agctgctgct gaaccctatg gcaattccac tgccgttgct gcaggtgatc 5220
ctggatttga gccgtagcgc ggacttcatg tacggtaatg cgcaggaccg tttcacgcac 5280
tccaccatga tgaaagatca agttgacctg gttctgaaag atccggtgaa actggacgat 5340
taagaattc 5349
<210> 76
<211> 5402
<212> DNA
<213> 人工序列
<220>
<223> 用作编码SaCP816、CPRm和SaSAS的合成操纵子
<400> 76
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttaggaggta aaacatatgg acagcagcac cgccaccgca atgaccgcac 3720
cattcatcga cccgacggat catgtgaatc tgaaaaccga cacggatgcg agcgaaaatc 3780
gtcgtatggg taactacaag ccgagcattt ggaactacga ttttctgcag tccctggcga 3840
cgcaccacaa cattgttgaa gagcgtcacc tgaagctggc agagaaactg aaaggtcaag 3900
tgaaattcat gttcggtgcg ccgatggagc cattggctaa gttggagctg gttgatgtgg 3960
tgcaacgctt gggtctgaac cacctgttcg agactgaaat caaagaagct ctgttcagca 4020
tctacaaaga tggcagcaat ggctggtggt ttggccatct gcatgctacc tctttgcgct 4080
tccgtctgtt gcgccaatgt ggcctgttta tcccgcagga cgttttcaaa acctttcaaa 4140
acaagaccgg tgagtttgac atgaagctgt gggacaacgt taagggcctg ctgagcctgt 4200
acgaggcgag ctacctgggc tggaagggcg agaacatctt ggatgaagca aaggcgttca 4260
cgaccaagtg cctgaagagc gcatgggaga acattagcga gaagtggctg gcgaagcgtg 4320
ttaaacatgc gttggcgctg ccgctgcact ggcgtgttcc gcgtattgaa gcacgctggt 4380
ttatcgaggt gtacgaacaa gaggccaata tgaatccgac gctgctgaaa ctggcgaaac 4440
tggacttcaa catggtccaa agcattcacc agaaagaaat cggtgaactg gcccgctggt 4500
gggttactac cggcctggac aagctggatt tcgcacgcaa caatctgttg cagtcttata 4560
tgtggagctg cgccatcgcg tccgacccga aattcaaact ggcgcgtgaa accattgtcg 4620
agatcggttc cgtgttgacg gttgtcgacg acggctatga tgtgtacggt tctatggatg 4680
agctggacct gtacaccagc tcggtggagc gttggtcctg tgtcaaaatt gacaagctgc 4740
ctaatacgct gaagctgatc tttatgtcta tgttcaacaa aaccaacgag gtgggtctgc 4800
gtgttcaaca cgagcgtggt tacaatagca tcccgacctt cattaaggcg tgggtggaac 4860
agtgtaagag ctatcaaaaa gaggcgcgtt ggtttcatgg tggtcacacg cctccgctgg 4920
aagaatacag cctgaacggt ctggtcagca ttggttttcc gctgttgctg atcaccggct 4980
atgttgcgat tgctgagaat gaagcagccc tggataaagt ccacccgctg ccggacctgc 5040
tgcattattc cagcttgctg agccgtctga ttaatgatat cggcactagc ccggatgaaa 5100
tggcgcgtgg tgacaatctg aagagcattc actgctatat gaatgaaacc ggtgccagcg 5160
aagaggtcgc acgcgagcac atcaaaggcg tcatcgaaga gaattggaaa attctgaacc 5220
agtgttgctt tgaccagtcc cagttccagg agccgttcat cacgtttaac ctgaacagcg 5280
tgcgcggctc gcatttcttc tatgaatttg gtgatggttt tggtgttacc gacagctgga 5340
ccaaggtgga tatgaaaagc gtcctgattg atccgattcc gctgggtgaa gagtaagctt 5400
gc 5402
<210> 77
<211> 1880
<212> DNA
<213> 檀香树(Santalum album)
<400> 77
atataaaagc aatagagaaa cgcactttcc cacaccatcc caccagtaag tcactttgcc 60
caagtcccta atacggtgga aagggcaaaa aaaaataacg gaaagggtaa aatatcccgc 120
aaatgtctcc gaccactgtc gccgtcgccg tcgccatcat cggagcactc tggctcctca 180
cgcgaaagcg ccggaagggg ccgggcctcc cgccaggccc acgggcctac ccgatcatcg 240
ggaacctcca catgatgggc cagctcccgc accacaacct ccgcgagctg gcccgggagt 300
acggccccat catgtcgatg cggctcggcc tcgtccccgc catcgtggtc tcctccccgg 360
aggcggcgca gctcttcctg aagacgcatg atacggtgtt cgcgagccgg ccgaagacgg 420
agacggcgaa gtacttccac tacgggatca agggtctcat cctgaccgag tacgggccgt 480
actggcgcaa catccggcgg ctgagcacgg tcaagctgct gaacgcggcg aagatcgatt 540
cgttcgcggc gatgaggcgg agcgaggtgg agaggctggt ggcgtcggtg agggggtcgg 600
cggtgcggcg ggaggtggtg gacgtgagct cgaaggtggc ggaggcaatg gagaacatgg 660
tgtgtcagat ggtgattggg aggagtgggg acgataggtt taagctgaag gagacgtttc 720
aggaggggac tcagttggcc ggagctttca attttgggga gttcgttccc tttctcctgc 780
cacttgacct tcagggaata acacggcgca taaaagaagt aagcacgagg ttcaacaaaa 840
tcttggattt aatcgtcgac gagcacatca gagacgccgc tggaaccaaa aattccggcg 900
gtcgagacag cgacaacttc ctcgacgtcc tcctttccct aatgaacacc tccatcagcg 960
actccaacga caccggcgac aacaaccgca acaacgtcat tgaacgagac aacatcaaag 1020
cgatcctcac cgatatgctc ggcgccgcca tggacacctc cgccagcacc gtcgagtgga 1080
ccatctccga gctcttccgc cacccaaaaa caatgcaaaa gctccaggcc gagattcggg 1140
gtgtcgtggg cccgacccgg aacgtgtctg aagacgacct cccaaagctc acttacctgg 1200
acatggtggt gaaggagggg atgcggcttc acccggcggt gccgctgctc ctcccccacg 1260
agtccctgga ggaggcgaca atcgatggtt attacattcc gaaggggtct cggatcctga 1320
tcaatgtgtg ggccatcggg cgcgacccga aggcctggcc tgatcgcccg gaggagttca 1380
tcccggagag gtttgagaaa agcaatgtgg atgtgctggg gagggatttc caactccttc 1440
cgttcggctc gggccgtaga gggtgcgccg ggattcggtt agggttgatt ttcgtgcgat 1500
tggtgctagc tcagctggtg cattgtttcg attgggagct cgcccgcaac atggcttcgt 1560
caccggagaa gttggacatg gaagagaagt tcgggctagc tgtgcataga gttaaccatt 1620
tgaaagcact gccgacttat cgcttggaat gctaaaagtt gctttctacc tatatatata 1680
cactcgctag gaaataaatg atgttttcaa atggaataat tttctttttt aatgaaatag 1740
cataagtatt gttggttgtt atttaccaaa aaaaaagaag tattgtcggt tgtttacgat 1800
ggtggtatta atgtgttttg atgcatgggt atatccatca ttttatttta acttagctaa 1860
tttttgagtt attgatgtat 1880
<210> 78
<211> 1533
<212> DNA
<213> 檀香树(Santalum album)
<400> 78
atgtctccga ccactgtcgc cgtcgccgtc gccatcatcg gagcactctg gctcctcacg 60
cgaaagcgcc ggaaggggcc gggcctcccg ccaggcccac gggcctaccc gatcatcggg 120
aacctccaca tgatgggcca gctcccgcac cacaacctcc gcgagctggc ccgggagtac 180
ggccccatca tgtcgatgcg gctcggcctc gtccccgcca tcgtggtctc ctccccggag 240
gcggcgcagc tcttcctgaa gacgcatgat acggtgttcg cgagccggcc gaagacggag 300
acggcgaagt acttccacta cgggatcaag ggtctcatcc tgaccgagta cgggccgtac 360
tggcgcaaca tccggcggct gagcacggtc aagctgctga acgcggcgaa gatcgattcg 420
ttcgcggcga tgaggcggag cgaggtggag aggctggtgg cgtcggtgag ggggtcggcg 480
gtgcggcggg aggtggtgga cgtgagctcg aaggtggcgg aggcaatgga gaacatggtg 540
tgtcagatgg tgattgggag gagtggggac gataggttta agctgaagga gacgtttcag 600
gaggggactc agttggccgg agctttcaat tttggggagt tcgttccctt tctcctgcca 660
cttgaccttc agggaataac acggcgcata aaagaagtaa gcacgaggtt caacaaaatc 720
ttggatttaa tcgtcgacga gcacatcaga gacgccgctg gaaccaaaaa ttccggcggt 780
cgagacagcg acaacttcct cgacgtcctc ctttccctaa tgaacacctc catcagcgac 840
tccaacgaca ccggcgacaa caaccgcaac aacgtcattg aacgagacaa catcaaagcg 900
atcctcaccg atatgctcgg cgccgccatg gacacctccg ccagcaccgt cgagtggacc 960
atctccgagc tcttccgcca cccaaaaaca atgcaaaagc tccaggccga gattcggggt 1020
gtcgtgggcc cgacccggaa cgtgtctgaa gacgacctcc caaagctcac ttacctggac 1080
atggtggtga aggaggggat gcggcttcac ccggcggtgc cgctgctcct cccccacgag 1140
tccctggagg aggcgacaat cgatggttat tacattccga aggggtctcg gatcctgatc 1200
aatgtgtggg ccatcgggcg cgacccgaag gcctggcctg atcgcccgga ggagttcatc 1260
ccggagaggt ttgagaaaag caatgtggat gtgctgggga gggatttcca actccttccg 1320
ttcggctcgg gccgtagagg gtgcgccggg attcggttag ggttgatttt cgtgcgattg 1380
gtgctagctc agctggtgca ttgtttcgat tgggagctcg cccgcaacat ggcttcgtca 1440
ccggagaagt tggacatgga agagaagttc gggctagctg tgcatagagt taaccatttg 1500
aaagcactgc cgacttatcg cttggaatgc taa 1533
<210> 79
<211> 510
<212> PRT
<213> 檀香树(Santalum album)
<400> 79
Met Ser Pro Thr Thr Val Ala Val Ala Val Ala Ile Ile Gly Ala Leu
1 5 10 15
Trp Leu Leu Thr Arg Lys Arg Arg Lys Gly Pro Gly Leu Pro Pro Gly
20 25 30
Pro Arg Ala Tyr Pro Ile Ile Gly Asn Leu His Met Met Gly Gln Leu
35 40 45
Pro His His Asn Leu Arg Glu Leu Ala Arg Glu Tyr Gly Pro Ile Met
50 55 60
Ser Met Arg Leu Gly Leu Val Pro Ala Ile Val Val Ser Ser Pro Glu
65 70 75 80
Ala Ala Gln Leu Phe Leu Lys Thr His Asp Thr Val Phe Ala Ser Arg
85 90 95
Pro Lys Thr Glu Thr Ala Lys Tyr Phe His Tyr Gly Ile Lys Gly Leu
100 105 110
Ile Leu Thr Glu Tyr Gly Pro Tyr Trp Arg Asn Ile Arg Arg Leu Ser
115 120 125
Thr Val Lys Leu Leu Asn Ala Ala Lys Ile Asp Ser Phe Ala Ala Met
130 135 140
Arg Arg Ser Glu Val Glu Arg Leu Val Ala Ser Val Arg Gly Ser Ala
145 150 155 160
Val Arg Arg Glu Val Val Asp Val Ser Ser Lys Val Ala Glu Ala Met
165 170 175
Glu Asn Met Val Cys Gln Met Val Ile Gly Arg Ser Gly Asp Asp Arg
180 185 190
Phe Lys Leu Lys Glu Thr Phe Gln Glu Gly Thr Gln Leu Ala Gly Ala
195 200 205
Phe Asn Phe Gly Glu Phe Val Pro Phe Leu Leu Pro Leu Asp Leu Gln
210 215 220
Gly Ile Thr Arg Arg Ile Lys Glu Val Ser Thr Arg Phe Asn Lys Ile
225 230 235 240
Leu Asp Leu Ile Val Asp Glu His Ile Arg Asp Ala Ala Gly Thr Lys
245 250 255
Asn Ser Gly Gly Arg Asp Ser Asp Asn Phe Leu Asp Val Leu Leu Ser
260 265 270
Leu Met Asn Thr Ser Ile Ser Asp Ser Asn Asp Thr Gly Asp Asn Asn
275 280 285
Arg Asn Asn Val Ile Glu Arg Asp Asn Ile Lys Ala Ile Leu Thr Asp
290 295 300
Met Leu Gly Ala Ala Met Asp Thr Ser Ala Ser Thr Val Glu Trp Thr
305 310 315 320
Ile Ser Glu Leu Phe Arg His Pro Lys Thr Met Gln Lys Leu Gln Ala
325 330 335
Glu Ile Arg Gly Val Val Gly Pro Thr Arg Asn Val Ser Glu Asp Asp
340 345 350
Leu Pro Lys Leu Thr Tyr Leu Asp Met Val Val Lys Glu Gly Met Arg
355 360 365
Leu His Pro Ala Val Pro Leu Leu Leu Pro His Glu Ser Leu Glu Glu
370 375 380
Ala Thr Ile Asp Gly Tyr Tyr Ile Pro Lys Gly Ser Arg Ile Leu Ile
385 390 395 400
Asn Val Trp Ala Ile Gly Arg Asp Pro Lys Ala Trp Pro Asp Arg Pro
405 410 415
Glu Glu Phe Ile Pro Glu Arg Phe Glu Lys Ser Asn Val Asp Val Leu
420 425 430
Gly Arg Asp Phe Gln Leu Leu Pro Phe Gly Ser Gly Arg Arg Gly Cys
435 440 445
Ala Gly Ile Arg Leu Gly Leu Ile Phe Val Arg Leu Val Leu Ala Gln
450 455 460
Leu Val His Cys Phe Asp Trp Glu Leu Ala Arg Asn Met Ala Ser Ser
465 470 475 480
Pro Glu Lys Leu Asp Met Glu Glu Lys Phe Gly Leu Ala Val His Arg
485 490 495
Val Asn His Leu Lys Ala Leu Pro Thr Tyr Arg Leu Glu Cys
500 505 510
<210> 80
<211> 1555
<212> DNA
<213> 人工序列
<220>
<223> SaCP120292, 用作编码N-末端经修饰的SaCP10374的优化的cDNA
<400> 80
aggaggtaaa acatatggca ctgctgctgg ctgtcttttg gagcgcactg attattctga 60
cccgcaaacg ccgcaaaggt ccgggtctgc caccgggtcc gcgtgcgtac ccgattattg 120
gcaatctgca catgatgggc cagctgccac accacaattt gcgtgagctg gcacgtgagt 180
atggtccgat tatgagcatg cgcctgggtc tggtgccggc aatcgtggtt agctctcctg 240
aggctgcgca gctgttcctc aagacgcatg ataccgtttt cgcgagccgt ccaaagaccg 300
agactgccaa atacttccat tacggtatca aaggtctgat cctgaccgag tatggcccgt 360
actggcgcaa tattcgtcgt ttgagcaccg ttaagctgtt gaatgccgcg aaaatcgata 420
gcttcgcggc tatgcgtaga agcgaagttg aacgcctggt cgcgtccgtt cgtggttcgg 480
cggttcgtcg tgaggttgtg gacgtcagca gcaaagtggc ggaagctatg gagaatatgg 540
tctgccagat ggttatcggc cgttcaggtg acgatcgttt taagctgaaa gaaacctttc 600
aagagggcac ccaactggca ggcgcgttca attttggtga gtttgtgccg tttctgctgc 660
cgctggactt gcaaggtatt acccgtcgca tcaaagaagt cagcactcgt ttcaataaga 720
ttttggacct gatcgttgac gagcacattc gcgatgccgc tggtaccaaa aacagcggcg 780
gtcgtgatag cgacaatttt ctggatgttc tgctgtcctt gatgaacacc tctattagcg 840
atagcaatga cacgggtgac aacaaccgta acaacgtgat cgagcgtgat aacattaaag 900
cgatcctgac ggacatgctg ggtgcagcga tggacacgag cgcgagcacg gtcgagtgga 960
cgatctccga actgtttcgc cacccgaaaa ccatgcagaa gctgcaagca gaaatccgtg 1020
gtgtcgtggg cccgacccgc aatgtgagcg aagatgactt gccgaagctg acctatctgg 1080
acatggtcgt taaggaaggc atgcgtttgc atccggccgt gccgctgctt ctgccgcatg 1140
agtctctgga agaagccacg atcgatggct actacattcc gaagggttcc cgcattctga 1200
tcaacgtctg ggcgattggt cgcgacccga aggcctggcc ggatcgtcct gaagagttca 1260
tcccggagcg tttcgagaaa agcaacgtgg atgtgctggg ccgtgacttc cagctgctgc 1320
cgtttggttc gggtcgtcgc ggttgtgcag gcattcgcct gggcctgatc ttcgtacgtc 1380
tggttctggc acagttagtt cactgtttcg actgggaact ggcgcgcaac atggcgagca 1440
gcccggagaa gttggatatg gaagagaagt tcggcctggc ggtgcatcgt gtcaaccacc 1500
tgaaagccct gccgacgtat cgtctggagt gctaagtcga caccatggaa agctt 1555
<210> 81
<211> 506
<212> PRT
<213> 人工序列
<220>
<223> SaCP10374opt, N-末端经修饰的氨基酸序列
<400> 81
Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Thr
1 5 10 15
Arg Lys Arg Arg Lys Gly Pro Gly Leu Pro Pro Gly Pro Arg Ala Tyr
20 25 30
Pro Ile Ile Gly Asn Leu His Met Met Gly Gln Leu Pro His His Asn
35 40 45
Leu Arg Glu Leu Ala Arg Glu Tyr Gly Pro Ile Met Ser Met Arg Leu
50 55 60
Gly Leu Val Pro Ala Ile Val Val Ser Ser Pro Glu Ala Ala Gln Leu
65 70 75 80
Phe Leu Lys Thr His Asp Thr Val Phe Ala Ser Arg Pro Lys Thr Glu
85 90 95
Thr Ala Lys Tyr Phe His Tyr Gly Ile Lys Gly Leu Ile Leu Thr Glu
100 105 110
Tyr Gly Pro Tyr Trp Arg Asn Ile Arg Arg Leu Ser Thr Val Lys Leu
115 120 125
Leu Asn Ala Ala Lys Ile Asp Ser Phe Ala Ala Met Arg Arg Ser Glu
130 135 140
Val Glu Arg Leu Val Ala Ser Val Arg Gly Ser Ala Val Arg Arg Glu
145 150 155 160
Val Val Asp Val Ser Ser Lys Val Ala Glu Ala Met Glu Asn Met Val
165 170 175
Cys Gln Met Val Ile Gly Arg Ser Gly Asp Asp Arg Phe Lys Leu Lys
180 185 190
Glu Thr Phe Gln Glu Gly Thr Gln Leu Ala Gly Ala Phe Asn Phe Gly
195 200 205
Glu Phe Val Pro Phe Leu Leu Pro Leu Asp Leu Gln Gly Ile Thr Arg
210 215 220
Arg Ile Lys Glu Val Ser Thr Arg Phe Asn Lys Ile Leu Asp Leu Ile
225 230 235 240
Val Asp Glu His Ile Arg Asp Ala Ala Gly Thr Lys Asn Ser Gly Gly
245 250 255
Arg Asp Ser Asp Asn Phe Leu Asp Val Leu Leu Ser Leu Met Asn Thr
260 265 270
Ser Ile Ser Asp Ser Asn Asp Thr Gly Asp Asn Asn Arg Asn Asn Val
275 280 285
Ile Glu Arg Asp Asn Ile Lys Ala Ile Leu Thr Asp Met Leu Gly Ala
290 295 300
Ala Met Asp Thr Ser Ala Ser Thr Val Glu Trp Thr Ile Ser Glu Leu
305 310 315 320
Phe Arg His Pro Lys Thr Met Gln Lys Leu Gln Ala Glu Ile Arg Gly
325 330 335
Val Val Gly Pro Thr Arg Asn Val Ser Glu Asp Asp Leu Pro Lys Leu
340 345 350
Thr Tyr Leu Asp Met Val Val Lys Glu Gly Met Arg Leu His Pro Ala
355 360 365
Val Pro Leu Leu Leu Pro His Glu Ser Leu Glu Glu Ala Thr Ile Asp
370 375 380
Gly Tyr Tyr Ile Pro Lys Gly Ser Arg Ile Leu Ile Asn Val Trp Ala
385 390 395 400
Ile Gly Arg Asp Pro Lys Ala Trp Pro Asp Arg Pro Glu Glu Phe Ile
405 410 415
Pro Glu Arg Phe Glu Lys Ser Asn Val Asp Val Leu Gly Arg Asp Phe
420 425 430
Gln Leu Leu Pro Phe Gly Ser Gly Arg Arg Gly Cys Ala Gly Ile Arg
435 440 445
Leu Gly Leu Ile Phe Val Arg Leu Val Leu Ala Gln Leu Val His Cys
450 455 460
Phe Asp Trp Glu Leu Ala Arg Asn Met Ala Ser Ser Pro Glu Lys Leu
465 470 475 480
Asp Met Glu Glu Lys Phe Gly Leu Ala Val His Arg Val Asn His Leu
485 490 495
Lys Ala Leu Pro Thr Tyr Arg Leu Glu Cys
500 505
<210> 82
<211> 3693
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm, 用作编码SaCP10374和CPRm的合成操纵子
<400> 82
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag ctt 3693
<210> 83
<211> 5339
<212> DNA
<213> 人工序列
<220>
<223> SaCP816-CPRm-SaTPs647, 用作编码SaCP816、CPRm和倍半香桧烯B合酶的合成
操纵子
<400> 83
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttaggaggta aaaatggcga ccgttgtgga tgattctagc gtcgttcgtc 3720
gttctgcaaa ctacccgccg aatttgtggg actatgagtt cctgcaatcc ctgggtgacc 3780
agtgtacggt cgaagaaaaa cacctgaagc tggccgacaa gttgaaagaa gaagttaaat 3840
ccctgattaa acagacgatg gagccgctgg caaaactgga gttcatcgat accgtgcgtc 3900
gtttgggttt gaaatatcag tttgagaccg aggtgaagga ggccgttgtt atggttagca 3960
aatatgagaa tgatgcgtgg tggattgata atctgcacgc taccagcctg cgtttccgca 4020
tcatgcgtga gaatggtatc ttcgtgccgc aagatgtgtt tgaacgtttc aaagataccg 4080
acggctttaa aaaccaactg tgcgaagacg tgaagggtct gttgtctctg tatgaggcga 4140
gctttctggg ttgggagggc gaggatatct tggatgaggc acgcaccttt gcgaccagca 4200
agctgaagag cattgaaggc aaaattccga gcccgagcct ggctaagaaa gtgagccacg 4260
cgctggactt gcctctgcac tggcgtacca ttcgctacga agcgcgctgg ttcatcgaca 4320
cctacggtga agaagaggac gtgaatctga cgttgctgcg ttacgccaaa ctggacttca 4380
acattgttca atctttttac caaaaagaga tcggccgtct gtcccgctgg tgggtgggta 4440
ctggcctgga taaaatgccg tttgctcgta atggtctgat tcagagctat atgtacgcaa 4500
ttggtatgct gttcgagcct aacctgggcg aggtgcgtga gatggaggcg aaggtcggcg 4560
ccttgattac cacgatcgac gacgtgtatg acgtttacgg cacgatggag gagttggagc 4620
tgttcaccga tattaccaat cgttgggaca tcagcaaagc ggatcaactg ccgcgtaaca 4680
tccgcatgcc gctgctgacg atgttcaaca ccagcaatga tatcggttat tgggctctga 4740
aagagcgtgg tttcaatggc attccgtgta ccgcaaaagt ctggtccgac caactgaaga 4800
gctacaccaa ggaggctaaa tggttccacg aaggccataa accgactctg gaggagtatc 4860
tggacaatgc gctggtcagc atcggcttcc cgaacctgct ggtcacgtct tatctgttga 4920
ccgttgagaa tccgaccaaa gaaaagctgg actatgtgaa cagcctgccg ttgttcgttc 4980
gcgcgagctg catcctgtgt cgtatcatta acgatctggg tacgagcccg gatgaaatgg 5040
agcgtggtga caatctgaaa agcatccagt gctatatgaa cgaaaccggt gcgagccaag 5100
aggttgcgcg tgagcacatc gaaggcctgg ttcgtatgtg gtggaaacgt ctgaacaagt 5160
gcctgtttga gccgagcccg ttcactgagc cgttcctgag ctttacgatt aacgtggtcc 5220
gtggtagcca ctttttctat cagtacggcg atggctacgg caacgcagag agctggacca 5280
agaaccaggg tatgtcggtg ctgatccacc cgattaccct ggatgaagag taagaattc 5339
<210> 84
<211> 5360
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-saTPs647, 用作编码SaCP10374、CPRm和倍半香桧烯B合酶的
合成操纵子
<400> 84
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggcg accgttgtgg 3720
atgattctag cgtcgttcgt cgttctgcaa actacccgcc gaatttgtgg gactatgagt 3780
tcctgcaatc cctgggtgac cagtgtacgg tcgaagaaaa acacctgaag ctggccgaca 3840
agttgaaaga agaagttaaa tccctgatta aacagacgat ggagccgctg gcaaaactgg 3900
agttcatcga taccgtgcgt cgtttgggtt tgaaatatca gtttgagacc gaggtgaagg 3960
aggccgttgt tatggttagc aaatatgaga atgatgcgtg gtggattgat aatctgcacg 4020
ctaccagcct gcgtttccgc atcatgcgtg agaatggtat cttcgtgccg caagatgtgt 4080
ttgaacgttt caaagatacc gacggcttta aaaaccaact gtgcgaagac gtgaagggtc 4140
tgttgtctct gtatgaggcg agctttctgg gttgggaggg cgaggatatc ttggatgagg 4200
cacgcacctt tgcgaccagc aagctgaaga gcattgaagg caaaattccg agcccgagcc 4260
tggctaagaa agtgagccac gcgctggact tgcctctgca ctggcgtacc attcgctacg 4320
aagcgcgctg gttcatcgac acctacggtg aagaagagga cgtgaatctg acgttgctgc 4380
gttacgccaa actggacttc aacattgttc aatcttttta ccaaaaagag atcggccgtc 4440
tgtcccgctg gtgggtgggt actggcctgg ataaaatgcc gtttgctcgt aatggtctga 4500
ttcagagcta tatgtacgca attggtatgc tgttcgagcc taacctgggc gaggtgcgtg 4560
agatggaggc gaaggtcggc gccttgatta ccacgatcga cgacgtgtat gacgtttacg 4620
gcacgatgga ggagttggag ctgttcaccg atattaccaa tcgttgggac atcagcaaag 4680
cggatcaact gccgcgtaac atccgcatgc cgctgctgac gatgttcaac accagcaatg 4740
atatcggtta ttgggctctg aaagagcgtg gtttcaatgg cattccgtgt accgcaaaag 4800
tctggtccga ccaactgaag agctacacca aggaggctaa atggttccac gaaggccata 4860
aaccgactct ggaggagtat ctggacaatg cgctggtcag catcggcttc ccgaacctgc 4920
tggtcacgtc ttatctgttg accgttgaga atccgaccaa agaaaagctg gactatgtga 4980
acagcctgcc gttgttcgtt cgcgcgagct gcatcctgtg tcgtatcatt aacgatctgg 5040
gtacgagccc ggatgaaatg gagcgtggtg acaatctgaa aagcatccag tgctatatga 5100
acgaaaccgg tgcgagccaa gaggttgcgc gtgagcacat cgaaggcctg gttcgtatgt 5160
ggtggaaacg tctgaacaag tgcctgtttg agccgagccc gttcactgag ccgttcctga 5220
gctttacgat taacgtggtc cgtggtagcc actttttcta tcagtacggc gatggctacg 5280
gcaacgcaga gagctggacc aagaaccagg gtatgtcggt gctgatccac ccgattaccc 5340
tggatgaaga gtaagaattc 5360
<210> 85
<211> 5420
<212> DNA
<213> 人工序列
<220>
<223> SaCP816-CPRm-SaTPs30, 用作编码SaCP816、CPRm和β-甜没药烯合酶的合成
操纵子
<400> 85
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttaggaggta aaaatggacg cattcgcaac gagcccgacc agcgcactga 3720
ttaaggcggt taactgcatc gcgcacgtga ccccgatggc aggtgaagat tcctccgaaa 3780
accgccgtgc atcgaactac aaaccgagca cctgggacta tgaatttctg caaagcctgg 3840
ccacgagcca taacaccgtc caggaaaagc acatgaagat ggctgagaaa ttgaaggaag 3900
aggtgaagag catgatcaag ggtcagatgg agccggtggc gaagttggaa ctgatcaaca 3960
tcctgcagcg tctgggtttg aaatatcgct ttgaatccga gatcaaggaa gagctgtttt 4020
ccctgtacaa ggacggtact gatgcgtggt gggttgataa tctgcatgca acggcgctgc 4080
gttttagact gctgcgcgag aatggtattt tcgtgccgca agaagtattc gaaactttaa 4140
aggataagag cggtaagttt aagagccagc tgtgcaagga cgttcgtggt ctgctgagct 4200
tgtacgaggc gtcctacctg ggttgggagg gtgaggactt gctggacgag gccaagaagt 4260
tcagcaccac caacctgaac aatgtgaaag aaagcatcag cagcaacact ctgggtcgct 4320
tggtcaagca cgccctgaac ctgccgctgc actggtctgc ggcacgttac gaggcgagat 4380
ggtttattga cgagtacgaa aaagaagaaa acgttaaccc gaacctgctg aagtacgcga 4440
agtttgactt taacatcgtt cagagcattc accaacgtga gctcggtaac ctcgcgcgtt 4500
ggtgggtaga aaccggcctg gataaactga gcttcgtgcg caatacgttg atgcagaatt 4560
tcatgtgggg ctgtgcgatg gtgttcgaac cgcagtacgg caaggttcgc gatgcggccg 4620
tcaagcaggc cagcctgatt gcgatggtcg acgacgtgta tgacgtttat ggcagcctgg 4680
aagaactgga aatctttacc gatatcgtgg accgttggga tatcaccggt atcgacaagc 4740
tgccgcgtaa catctctatg attctgctga cgatgttcaa taccgcgaat cagattggtt 4800
acgacttgct gcgtgaccgc ggttttaacg gcatcccgca cattgctcag gcgtgggcca 4860
ccctgtgtaa gaaatatctg aaagaggcga agtggtatca tagcggttac aagccaactc 4920
tggaggagta cctggaaaac ggtcttgttt ctattagctt tgtgctgagc cttgttaccg 4980
catatctgca gaccgaaacc ctggagaatc tgacgtatga gtccgctgcg tacgtgaata 5040
gcgtaccgcc actggtccgc tacagcggcc tgctgaatcg tctgtacaac gatctcggta 5100
cgtcaagcgc agaaattgca cgtggtgaca ccctgaaaag catccagtgt tatatgaccc 5160
aaaccggtgc aaccgaggaa gcagcgcgcg agcacattaa aggtctggtt cacgaagcgt 5220
ggaagggcat gaacaaatgc ttgttcgagc agacgccatt cgcggagccg tttgtcggtt 5280
tcaacgtcaa taccgtccgc ggttcccaat tcttctacca gcatggcgac ggctacgcgg 5340
ttacggaaag ctggacgaag gacctgagcc tgtcggtgct gattcacccg atcccgctga 5400
atgaagagga ctaagaattc 5420
<210> 86
<211> 5441
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-SaTPs30, 用作编码SaCP10374、CPRm和β-甜没药烯合酶的
合成操纵子
<400> 86
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggac gcattcgcaa 3720
cgagcccgac cagcgcactg attaaggcgg ttaactgcat cgcgcacgtg accccgatgg 3780
caggtgaaga ttcctccgaa aaccgccgtg catcgaacta caaaccgagc acctgggact 3840
atgaatttct gcaaagcctg gccacgagcc ataacaccgt ccaggaaaag cacatgaaga 3900
tggctgagaa attgaaggaa gaggtgaaga gcatgatcaa gggtcagatg gagccggtgg 3960
cgaagttgga actgatcaac atcctgcagc gtctgggttt gaaatatcgc tttgaatccg 4020
agatcaagga agagctgttt tccctgtaca aggacggtac tgatgcgtgg tgggttgata 4080
atctgcatgc aacggcgctg cgttttagac tgctgcgcga gaatggtatt ttcgtgccgc 4140
aagaagtatt cgaaacttta aaggataaga gcggtaagtt taagagccag ctgtgcaagg 4200
acgttcgtgg tctgctgagc ttgtacgagg cgtcctacct gggttgggag ggtgaggact 4260
tgctggacga ggccaagaag ttcagcacca ccaacctgaa caatgtgaaa gaaagcatca 4320
gcagcaacac tctgggtcgc ttggtcaagc acgccctgaa cctgccgctg cactggtctg 4380
cggcacgtta cgaggcgaga tggtttattg acgagtacga aaaagaagaa aacgttaacc 4440
cgaacctgct gaagtacgcg aagtttgact ttaacatcgt tcagagcatt caccaacgtg 4500
agctcggtaa cctcgcgcgt tggtgggtag aaaccggcct ggataaactg agcttcgtgc 4560
gcaatacgtt gatgcagaat ttcatgtggg gctgtgcgat ggtgttcgaa ccgcagtacg 4620
gcaaggttcg cgatgcggcc gtcaagcagg ccagcctgat tgcgatggtc gacgacgtgt 4680
atgacgttta tggcagcctg gaagaactgg aaatctttac cgatatcgtg gaccgttggg 4740
atatcaccgg tatcgacaag ctgccgcgta acatctctat gattctgctg acgatgttca 4800
ataccgcgaa tcagattggt tacgacttgc tgcgtgaccg cggttttaac ggcatcccgc 4860
acattgctca ggcgtgggcc accctgtgta agaaatatct gaaagaggcg aagtggtatc 4920
atagcggtta caagccaact ctggaggagt acctggaaaa cggtcttgtt tctattagct 4980
ttgtgctgag ccttgttacc gcatatctgc agaccgaaac cctggagaat ctgacgtatg 5040
agtccgctgc gtacgtgaat agcgtaccgc cactggtccg ctacagcggc ctgctgaatc 5100
gtctgtacaa cgatctcggt acgtcaagcg cagaaattgc acgtggtgac accctgaaaa 5160
gcatccagtg ttatatgacc caaaccggtg caaccgagga agcagcgcgc gagcacatta 5220
aaggtctggt tcacgaagcg tggaagggca tgaacaaatg cttgttcgag cagacgccat 5280
tcgcggagcc gtttgtcggt ttcaacgtca ataccgtccg cggttcccaa ttcttctacc 5340
agcatggcga cggctacgcg gttacggaaa gctggacgaa ggacctgagc ctgtcggtgc 5400
tgattcaccc gatcccgctg aatgaagagg actaagaatt c 5441
<210> 87
<211> 5414
<212> DNA
<213> 人工序列
<220>
<223> SaCP816-CPRm-AaBFS, 用作编码SaCP816、CPRm和β-法呢烯合酶的合成操纵子
<400> 87
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttaggaggta aaaatgtcta ccctgccaat ttcttctgtg tcctttagct 3720
ccagcacttc gccactggtt gtcgatgaca aggtgagcac gaaaccggat gtgatccgtc 3780
acacgatgaa cttcaacgcg agcatttggg gcgatcaatt cctgacctat gacgagccgg 3840
aagatctggt aatgaagaaa caactggttg aggaacttaa agaagaagtg aagaaagaat 3900
tgatcaccat caagggtagc aacgagccga tgcaacatgt caagctgatc gagttgatcg 3960
acgcagttca acgcctgggc attgcctacc actttgaaga agagattgaa gaggccctgc 4020
agcacattca tgtcacctac ggtgagcagt gggtggacaa agagaatttg caatccatca 4080
gcctgtggtt tcgtctgctg cgtcaacagg gcttcaacgt gagcagcggt gtgtttaaag 4140
atttcatgga cgaaaagggt aagtttaaag agtccctgtg caatgatgca cagggtattt 4200
tggcgctgta tgaggccgca ttcatgcgcg ttgaagatga aaccattctg gacaacgctc 4260
tggagttcac caaggtgcat ctggacatca tcgctaagga cccgagctgt gattctagcc 4320
tgcgcacgca gattcaccag gctctgaagc agccgctgcg ccgtcgcctg gcacgtattg 4380
aggcgttaca ctatatgccg atctatcagc aagagactag ccatgacgaa gttctgctga 4440
aactggcaaa gctggacttt agcgttctgc agagcatgca caagaaagaa ctcagccata 4500
tttgcaagtg gtggaaagat ctggatctgc agaataagct gccgtacgtt cgtgaccgtg 4560
tcgttgaggg ctatttctgg atcttgagca tttactacga gccgcaacat gcgcgtaccc 4620
gtatgttcct gatgaaaacc tgtatgtggt tggttgtgct ggacgacacg tttgataact 4680
acggcacgta cgaagagttg gagattttca cccaagcggt agaacgttgg agcatctcgt 4740
gtctggacat gctgcctgag tatatgaagc tgatctacca ggaactggtc aatttacacg 4800
tcgagatgga agagagcctg gagaaagaag gcaagaccta tcagattcac tatgtgaaag 4860
aaatggcgaa agaactggtc cgcaactacc tggttgaggc gcgctggctg aaagagggct 4920
acatgccgac cctggaagag tatatgagcg tgagcatggt cacgggtacg tacggtctga 4980
tgatcgcccg cagctacgtc ggtcgtggcg acatcgttac cgaagatacc ttcaaatggg 5040
tttctagcta cccgccgatt atcaaggcaa gctgcgttat cgtgcgtttg atggatgata 5100
ttgttagcca caaagaagaa caagagcgtg gtcatgttgc tagcagcatt gagtgctaca 5160
gcaaagagtc cggtgcaagc gaagaagaag cgtgcgagta tatcagccgt aaggtcgagg 5220
acgcgtggaa agtcattaat cgcgagtccc tgcgtccgac cgcggttccg tttccgctgc 5280
tgatgcctgc gattaatctg gcgcgtatgt gtgaggtcct gtacagcgtg aatgacggtt 5340
ttacgcacgc cgagggtgat atgaaaagct atatgaagtc attctttgtg cacccgatgg 5400
ttgtgtaaga attc 5414
<210> 88
<211> 5435
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-AaBFS, 用作编码SaCP10374、CPRm和β-法呢烯合酶的合成
操纵子
<400> 88
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatgtct accctgccaa 3720
tttcttctgt gtcctttagc tccagcactt cgccactggt tgtcgatgac aaggtgagca 3780
cgaaaccgga tgtgatccgt cacacgatga acttcaacgc gagcatttgg ggcgatcaat 3840
tcctgaccta tgacgagccg gaagatctgg taatgaagaa acaactggtt gaggaactta 3900
aagaagaagt gaagaaagaa ttgatcacca tcaagggtag caacgagccg atgcaacatg 3960
tcaagctgat cgagttgatc gacgcagttc aacgcctggg cattgcctac cactttgaag 4020
aagagattga agaggccctg cagcacattc atgtcaccta cggtgagcag tgggtggaca 4080
aagagaattt gcaatccatc agcctgtggt ttcgtctgct gcgtcaacag ggcttcaacg 4140
tgagcagcgg tgtgtttaaa gatttcatgg acgaaaaggg taagtttaaa gagtccctgt 4200
gcaatgatgc acagggtatt ttggcgctgt atgaggccgc attcatgcgc gttgaagatg 4260
aaaccattct ggacaacgct ctggagttca ccaaggtgca tctggacatc atcgctaagg 4320
acccgagctg tgattctagc ctgcgcacgc agattcacca ggctctgaag cagccgctgc 4380
gccgtcgcct ggcacgtatt gaggcgttac actatatgcc gatctatcag caagagacta 4440
gccatgacga agttctgctg aaactggcaa agctggactt tagcgttctg cagagcatgc 4500
acaagaaaga actcagccat atttgcaagt ggtggaaaga tctggatctg cagaataagc 4560
tgccgtacgt tcgtgaccgt gtcgttgagg gctatttctg gatcttgagc atttactacg 4620
agccgcaaca tgcgcgtacc cgtatgttcc tgatgaaaac ctgtatgtgg ttggttgtgc 4680
tggacgacac gtttgataac tacggcacgt acgaagagtt ggagattttc acccaagcgg 4740
tagaacgttg gagcatctcg tgtctggaca tgctgcctga gtatatgaag ctgatctacc 4800
aggaactggt caatttacac gtcgagatgg aagagagcct ggagaaagaa ggcaagacct 4860
atcagattca ctatgtgaaa gaaatggcga aagaactggt ccgcaactac ctggttgagg 4920
cgcgctggct gaaagagggc tacatgccga ccctggaaga gtatatgagc gtgagcatgg 4980
tcacgggtac gtacggtctg atgatcgccc gcagctacgt cggtcgtggc gacatcgtta 5040
ccgaagatac cttcaaatgg gtttctagct acccgccgat tatcaaggca agctgcgtta 5100
tcgtgcgttt gatggatgat attgttagcc acaaagaaga acaagagcgt ggtcatgttg 5160
ctagcagcat tgagtgctac agcaaagagt ccggtgcaag cgaagaagaa gcgtgcgagt 5220
atatcagccg taaggtcgag gacgcgtgga aagtcattaa tcgcgagtcc ctgcgtccga 5280
ccgcggttcc gtttccgctg ctgatgcctg cgattaatct ggcgcgtatg tgtgaggtcc 5340
tgtacagcgt gaatgacggt tttacgcacg ccgagggtga tatgaaaagc tatatgaagt 5400
cattctttgt gcacccgatg gttgtgtaag aattc 5435
<210> 89
<211> 5432
<212> DNA
<213> 人工序列
<220>
<223> SaCP816-CPRm-PaAFS, 用作编码SaCP816、CPRm和α-法呢烯合酶的合成操纵子
<400> 89
catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60
ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120
ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180
tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240
gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300
caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360
tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420
agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480
agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540
gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600
caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660
ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720
atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780
catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840
ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900
gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960
cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020
gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080
ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140
ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200
accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260
gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320
ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380
ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440
ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500
taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560
ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620
ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680
acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740
ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800
ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860
acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920
cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980
ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040
gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100
aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160
atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220
ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280
tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340
agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400
gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460
tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520
acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580
ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640
agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700
agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760
gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820
cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880
cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940
aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000
tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060
tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120
gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180
aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240
ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300
ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360
gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420
cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480
tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540
cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600
gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660
taataaaagc ttaggaggta aaaatggatc tggcagtgga aatcgcaatg gacctggcag 3720
tggatgatgt tgaacgccgt gtgggtgact atcattccaa tctgtgggac gacgacttca 3780
tccaaagcct gagcaccccg tatggcgcca gctcttaccg cgagcgtgcg gagcgcttgg 3840
tcggcgaggt caaagaaatg tttacgagca tcagcatcga ggatggtgag ctgacctctg 3900
acttgcttca acgcctgtgg atggttgaca acgttgagcg cctgggcatt agccgtcact 3960
tcgagaatga aatcaaggct gcaattgatt acgtctacag ctattggtcc gacaagggta 4020
ttgtccgtgg tagagatagc gccgtgccgg atctgaacag cattgctctg ggtttccgta 4080
cgttgcgtct gcatggttac accgttagca gcgatgtttt caaggtcttt caagaccgca 4140
aaggtgagtt tgcatgtagc gcgattccga cggaaggtga catcaaaggc gtactgaatc 4200
tgctgcgtgc aagctacatc gcgttccctg gtgagaaagt gatggagaaa gcgcagacct 4260
ttgccgcaac ttatctgaaa gaagcactgc agaagatcca agtgtctagc ctgagccgcg 4320
agatcgaata cgttctggag tatggctggc tgaccaattt tccgcgcctg gaagcgcgta 4380
actacatcga cgttttcggt gaggaaattt gtccgtactt caagaaaccg tgtattatgg 4440
ttgataagct gctggaactg gcgaaactgg agttcaattt gtttcactcg ctgcaacaga 4500
ccgagctgaa acacgtttcc cgttggtgga aggatagcgg ctttagccag ctgaccttca 4560
cgcgtcatcg tcacgtggag ttttacaccc tggctagctg tattgcaatt gaaccgaaac 4620
attctgcgtt tcgtttgggt ttcgcgaagg tctgctacct gggcattgtg ctggacgata 4680
tctatgacac gttcggtaaa atgaaagaac tggagttatt cacggcggca atcaagcgtt 4740
gggacccgag cacgaccgag tgcctgcctg agtatatgaa aggtgtctac atggcgtttt 4800
acaactgcgt taatgaactg gcgctgcaag ccgagaaaac ccagggccgt gacatgttga 4860
actatgcacg taaggcgtgg gaagccctgt tcgatgcgtt cctggaagaa gcgaagtgga 4920
ttagctccgg ctatctgccg acctttgagg aatacctgga gaacggcaaa gtgtccttcg 4980
gttatcgtgc tgccactctg cagccaatcc tgaccctgga cattccgttg ccgctgcaca 5040
tcttgcagca gatcgatttc ccgagccgct ttaatgacct ggccagctca attttgcgtc 5100
tgcgcggtga tatctgcggt tatcaagccg agcgttctcg tggcgaagag gcgagcagca 5160
ttagctgcta catgaaggac aatccgggtt ccaccgagga agatgcgctg agccacatta 5220
acgcgatgat ttcggacaac atcaacgaac tgaattggga gctgctgaag ccgaacagca 5280
atgttccaat cagcagcaaa aagcacgctt tcgatatcct gcgtgcgttt taccatctct 5340
ataagtaccg tgatggtttt agcattgcga agattgaaac gaagaacctg gtgatgcgca 5400
ccgtcctgga gccggtcccg atgtaagaat tc 5432
<210> 90
<211> 5453
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-PaAFS, 用作编码SaCP10374、CPRm和α-法呢烯合酶的合成
操纵子
<400> 90
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggat ctggcagtgg 3720
aaatcgcaat ggacctggca gtggatgatg ttgaacgccg tgtgggtgac tatcattcca 3780
atctgtggga cgacgacttc atccaaagcc tgagcacccc gtatggcgcc agctcttacc 3840
gcgagcgtgc ggagcgcttg gtcggcgagg tcaaagaaat gtttacgagc atcagcatcg 3900
aggatggtga gctgacctct gacttgcttc aacgcctgtg gatggttgac aacgttgagc 3960
gcctgggcat tagccgtcac ttcgagaatg aaatcaaggc tgcaattgat tacgtctaca 4020
gctattggtc cgacaagggt attgtccgtg gtagagatag cgccgtgccg gatctgaaca 4080
gcattgctct gggtttccgt acgttgcgtc tgcatggtta caccgttagc agcgatgttt 4140
tcaaggtctt tcaagaccgc aaaggtgagt ttgcatgtag cgcgattccg acggaaggtg 4200
acatcaaagg cgtactgaat ctgctgcgtg caagctacat cgcgttccct ggtgagaaag 4260
tgatggagaa agcgcagacc tttgccgcaa cttatctgaa agaagcactg cagaagatcc 4320
aagtgtctag cctgagccgc gagatcgaat acgttctgga gtatggctgg ctgaccaatt 4380
ttccgcgcct ggaagcgcgt aactacatcg acgttttcgg tgaggaaatt tgtccgtact 4440
tcaagaaacc gtgtattatg gttgataagc tgctggaact ggcgaaactg gagttcaatt 4500
tgtttcactc gctgcaacag accgagctga aacacgtttc ccgttggtgg aaggatagcg 4560
gctttagcca gctgaccttc acgcgtcatc gtcacgtgga gttttacacc ctggctagct 4620
gtattgcaat tgaaccgaaa cattctgcgt ttcgtttggg tttcgcgaag gtctgctacc 4680
tgggcattgt gctggacgat atctatgaca cgttcggtaa aatgaaagaa ctggagttat 4740
tcacggcggc aatcaagcgt tgggacccga gcacgaccga gtgcctgcct gagtatatga 4800
aaggtgtcta catggcgttt tacaactgcg ttaatgaact ggcgctgcaa gccgagaaaa 4860
cccagggccg tgacatgttg aactatgcac gtaaggcgtg ggaagccctg ttcgatgcgt 4920
tcctggaaga agcgaagtgg attagctccg gctatctgcc gacctttgag gaatacctgg 4980
agaacggcaa agtgtccttc ggttatcgtg ctgccactct gcagccaatc ctgaccctgg 5040
acattccgtt gccgctgcac atcttgcagc agatcgattt cccgagccgc tttaatgacc 5100
tggccagctc aattttgcgt ctgcgcggtg atatctgcgg ttatcaagcc gagcgttctc 5160
gtggcgaaga ggcgagcagc attagctgct acatgaagga caatccgggt tccaccgagg 5220
aagatgcgct gagccacatt aacgcgatga tttcggacaa catcaacgaa ctgaattggg 5280
agctgctgaa gccgaacagc aatgttccaa tcagcagcaa aaagcacgct ttcgatatcc 5340
tgcgtgcgtt ttaccatctc tataagtacc gtgatggttt tagcattgcg aagattgaaa 5400
cgaagaacct ggtgatgcgc accgtcctgg agccggtccc gatgtaagaa ttc 5453
<210> 91
<211> 5370
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-ClTps2, 用作编码SaCP10374、CPRm和α-檀香萜
合酶的合成操纵子
<400> 91
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttgaaggag atatactaat gtctacccag 3720
caggttagct ccgagaatat cgttcgcaac gcggcgaact tccacccgaa tatctggggt 3780
aatcatttct tgacgtgtcc aagccagacg atcgattctt ggacgcaaca acaccataaa 3840
gagctgaaag aagaggtccg caagatgatg gtgagcgacg caaacaaacc ggcacaacgt 3900
ctgcgtctga ttgacaccgt tcaacgtttg ggcgtggcgt atcatttcga aaaagaaatc 3960
gatgacgctc tggaaaagat cggtcacgat ccgtttgacg ataaggatga cctgtatatc 4020
gttagcctgt gttttcgcct gctgcgtcag catggcatca agattagctg cgatgttttt 4080
gagaagttca aagacgacga tggcaagttt aaggcttccc tgatgaatga tgtccaaggt 4140
atgctgtcgt tgtatgaagc ggcccacctg gcaattcatg gcgaggacat cctggatgag 4200
gctattgtct ttacgaccac ccacctgaag agcaccgttt ctaactcccc ggtcaattcc 4260
acctttgcgg aacagattcg ccacagcctg cgtgtgccgc tgcgtaaggc agtcccgcgt 4320
ttggagagcc gctacttcct ggatatctat agccgtgacg acctgcacga caagactctg 4380
ctgaactttg ccaaactgga cttcaacatc ctgcaggcga tgcaccagaa agaggcaagc 4440
gagatgaccc gttggtggcg tgatttcgat ttcctgaaga agctgccgta cattcgtgat 4500
cgcgtggttg aactgtactt ttggattttg gtcggtgtga gctaccaacc gaaattcagc 4560
acgggtcgta tctttttgag caagattatc tgtctggaaa ccctggtgga cgacacgttt 4620
gatgcgtacg gtactttcga cgaactggcc attttcaccg aggccgttac gcgttgggac 4680
ctgggtcatc gcgacgcgct gcctgagtac atgaaattca ttttcaagac cctgattgat 4740
gtgtacagcg aggcggaaca agagctggca aaagagggcc gctcctatag cattcactat 4800
gcgatccgta gcttccagga gttggtcatg aagtactttt gcgaggcgaa atggctgaat 4860
aagggttatg ttccgagcct ggatgactac aagagcgtca gcctgcgcag catcggcttc 4920
ctgccgatcg ccgtggcttc ttttgttttc atgggcgaca ttgctacgaa agaggttttt 4980
gagtgggaaa tgaataaccc gaaaatcatc atcgcagccg aaaccatttt ccgctttctg 5040
gatgacattg caggtcatcg cttcgaacaa aaacgtgagc acagcccgag cgcaatcgag 5100
tgctacaaaa accaacatgg tgtctcggaa gaagaggcag tgaaagcgct gagcttggag 5160
gtcgccaatt cgtggaaaga cattaacgaa gagctgctgc tgaaccctat ggcaattcca 5220
ctgccgttgc tgcaggtgat cctggatttg agccgtagcg cggacttcat gtacggtaat 5280
gcgcaggacc gtttcacgca ctccaccatg atgaaagatc aagttgacct ggttctgaaa 5340
gatccggtga aactggacga ttaagaattc 5370
<210> 92
<211> 5423
<212> DNA
<213> 人工序列
<220>
<223> SaCP10374-CPRm-SaTps8201, 用作编码SaCP10374, CPRm和α-/β-檀香萜合酶
的合成操纵子
<400> 92
catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60
cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120
atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180
atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240
ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300
tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360
attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420
atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480
gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540
gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600
caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660
caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720
atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780
gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840
acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900
gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960
ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020
ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080
aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140
gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200
gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260
ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320
ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380
cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440
ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500
ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560
cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620
agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680
aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740
ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800
gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860
tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920
aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980
gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040
tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100
aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160
caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220
ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280
gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340
gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400
cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460
gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520
ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580
tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640
gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700
gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760
acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820
ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880
agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940
atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000
cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060
cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120
tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180
ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240
atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300
ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360
aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420
gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480
agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540
ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600
caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660
cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaacatatg gacagcagca 3720
ccgccaccgc aatgaccgca ccattcatcg acccgacgga tcatgtgaat ctgaaaaccg 3780
acacggatgc gagcgaaaat cgtcgtatgg gtaactacaa gccgagcatt tggaactacg 3840
attttctgca gtccctggcg acgcaccaca acattgttga agagcgtcac ctgaagctgg 3900
cagagaaact gaaaggtcaa gtgaaattca tgttcggtgc gccgatggag ccattggcta 3960
agttggagct ggttgatgtg gtgcaacgct tgggtctgaa ccacctgttc gagactgaaa 4020
tcaaagaagc tctgttcagc atctacaaag atggcagcaa tggctggtgg tttggccatc 4080
tgcatgctac ctctttgcgc ttccgtctgt tgcgccaatg tggcctgttt atcccgcagg 4140
acgttttcaa aacctttcaa aacaagaccg gtgagtttga catgaagctg tgcgacaacg 4200
ttaagggcct gctgagcctg tacgaggcga gctacctggg ctggaagggc gagaacatct 4260
tggatgaagc aaaggcgttc acgaccaagt gcctgaagag cgcatgggag aacattagcg 4320
agaagtggct ggcgaagcgt gttaaacatg cgttggcgct gccgctgcac tggcgtgttc 4380
cgcgtattga agcacgctgg tttatcgagg cctacgaaca agaggccaat atgaatccga 4440
cgctgctgaa actggcgaaa ctggacttca acatggtcca aagcattcac cagaaagaaa 4500
tcggtgaact ggcccgctgg tgggttacta ccggcctgga caagctggcg ttcgcacgca 4560
acaatctgtt gcagtcttat atgtggagct gcgccatcgc gtccgacccg aaattcaaac 4620
tggcgcgtga aaccattgtc gagatcggtt ccgtgttgac ggttgtcgac gacggctatg 4680
atgtgtacgg ttctatcgat gagctggacc tgtacaccag ctcggtggag cgttggtcct 4740
gtgtcgagat tgacaagctg cctaatacgc tgaagctgat ctttatgtct atgttcaaca 4800
aaaccaacga ggtgggtctg cgtgttcaac acgagcgtgg ttacaatagc atcccgacct 4860
tcattaaggc gtgggtggaa cagtgtaaga gctatcaaaa agaggcgcgt tggtttcatg 4920
gtggtcacac gcctccgctg gaagaataca gcctgaacgg tctggtcagc attggttttc 4980
cgctgttgct gatcaccggc tatgttgcga ttgctgagaa tgaagcagcc ctggataaag 5040
tccacccgct gccggacctg ctgcattatt ccagcttgct gagccgtctg attaatgata 5100
tcggcactag cccggatgaa atggcgcgtg gtgacaatct gaagagcatt cactgctata 5160
tgaatgaaac cggtgccagc gaagaggtcg cacgcgagca catcaaaggc gtcatcgaag 5220
agaattggaa aattctgaac cagtgttgct ttgaccagtc ccagttccag gagccgttca 5280
tcacgtttaa cctgaacagc gtgcgcggct cgcatttctt ctatgaattt ggtgatggtt 5340
ttggtgttac cgacagctgg accaaggtgg atatgaaaag cgtcctgatt gatccgattc 5400
cgctgggtga agagtaagaa ttc 5423
<210> 93
<211> 53
<212> DNA
<213> 人工序列
<220>
<223> 诱变反向引物AV8-L358-rev
<220>
<221> misc_feature
<222> (22)..(22)
<223> n为a, c, g或t
<400> 93
cacgcggcat caccagcgga vncggcggat gcaggcgcag ggtttcttta atc 53
<210> 94
<211> 40
<212> DNA
<213> 人工序列
<220>
<223> 引物AV8-pcw-fw
<400> 94
catcgatgct taggaggtca tatggctctg ttattagcag 40
<210> 95
<211> 25
<212> DNA
<213> 人工序列
<220>
<223> 引物AV8-L358-fw
<400> 95
tccgctggtg atgccgcgtg agtgc 25
<210> 96
<211> 38
<212> DNA
<213> 人工序列
<220>
<223> 引物AV8-CPR-rev
<400> 96
atatatctcc ttcttaaagt tagtcgactc attaggtg 38
<210> 97
<211> 65
<212> DNA
<213> 人工序列
<220>
<223> 正向引物CPRm_aaBFS_Inf1
<400> 97
ttacctgcgt gatgtgtggt aataaaagct taggaggtaa aaatgtctac cctgccaatt 60
tcttc 65
<210> 98
<211> 63
<212> DNA
<213> 人工序列
<220>
<223> 反向引物AaBFS_Inf2
<400> 98
atgtttgaca gcttatcatc gataagctga attcttacac aaccatcggg tgcacaaaga 60
atg 63
<210> 99
<211> 65
<212> DNA
<213> 人工序列
<220>
<223> 正向引物CPRm_PaAFS_Inf1
<400> 99
ttacctgcgt gatgtgtggt aataaaagct taggaggtaa aaatggatct ggcagtggaa 60
atcgc 65
<210> 100
<211> 66
<212> DNA
<213> 人工序列
<220>
<223> 反向引物PaAFS_Inf2
<400> 100
ctcatgtttg acagcttatc atcgataagc tgaattctta catcgggacc ggctccagga 60
cggtgc 66
<210> 101
<211> 60
<212> DNA
<213> 人工序列
<220>
<223> 正向引物CPRm_Tps647_inf1
<400> 101
gcgtgatgtg tggtaataaa agcttaggag gtaaaaatgg cgaccgttgt ggatgattct 60
<210> 102
<211> 53
<212> DNA
<213> 人工序列
<220>
<223> 反向引物Tps647_Inf2
<400> 102
gcttatcatc gataagctga attcttactc ttcatccagg gtaatcgggt gga 53
<210> 103
<211> 58
<212> DNA
<213> 人工序列
<220>
<223> 正向引物CPRm_Tps30_Inf1
<400> 103
gcgtgatgtg tggtaataaa agcttaggag gtaaaaatgg acgcattcgc aacgagcc 58
<210> 104
<211> 59
<212> DNA
<213> 人工序列
<220>
<223> 反向引物Tps30_Inf2
<400> 104
gtgatgtgtg gtaataaaaa gctgaattct tagtcctctt cattcagcgg gatcgggtg 59