掌桥专利:专业的专利平台
掌桥专利
首页

本发明是2014年9月19日申请的发明名称为“用于生产芳香醇的方法”的第201480051538.3号发明专利申请的分案申请。

技术领域

本领域涉及细胞色素P450s,及其生产倍半萜醇的用途。

背景技术

萜烃(比如α-檀香萜和β-檀香萜)通过生化途径(例如通过遗传改造的细胞)来生产。这些萜和源于该萜的醇是檀香油的主要组成成分,并且该醇是通常通过蒸馏檀香属物种(比如檀香木)的心材可商业获得的重要的香料成分。作为该醇的具体例,包括α-甜橙醇(α-sinensol)、β-甜橙醇(β-sinensol)、α-檀香醇、β-檀香醇、α-反- 香柠檬醇和表-β-檀香醇。虽然发展出了新型生化途径(包括基因工程细胞)来生成萜烃,但仍期望发现一种生化途径来生成并生产源于檀香萜的醇。更进一步期望使用一种生化途径,其不仅能生成该醇,并且更期望能够通过该生化途径来选择性地生产该醇的顺式异构体,比如α-异甜橙醇(iso-α-sinensol)、β-异甜橙醇、(Z)-α- 檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇。

细胞色素P450s代表的是氧化酶类的酶族。P450s常用来催化单加氧酶反应。基于氨基酸序列的同源性,将细胞色素P450s酶分类为族和子族。相同子族的成员共享超过55%的氨基酸序列同一性并且具有通常相似的酶活性(底物和/或产物的选择性)。 CYP71AV1(NCBI登录号ABB82944.1,SEQ ID No.51和52)和 CYP71AV8(NCBI登录号ADM86719.1,SEQID No.1和2)是CYP71AV子族的两个成员,并且共享78%的氨基酸序列同一性。 CYP71AV1已被证明可氧化紫穗槐二烯(Teoh et al,FEBS letters 580 (2006)1411-1416)。CYP71AV8已被证明可氧化(+)-朱栾倍半萜、大根香叶烯A和紫穗槐二烯(Cankar et al,FEBSLett.585(1),178-182 (2011))。

已经报道有:在使用工程细胞的工序中,使用萜合酶来催化生产二萜或倍半萜。使用细胞色素P450多肽对该二萜或倍半萜进行进一步处理,从而催化由细胞产生的二萜或倍半萜的羟基化、氧化、脱甲基化或甲基化。

发明内容

本发明提供一种用于生产倍半萜醇的方法,包括:

i)使式(I)的萜烯与多肽接触,

该多肽具有与从由SEQ ID NO:2,SEQ ID NO:4,SEQ ID NO:6, SEQ ID NO:8,SEQID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO: 34,SEQ ID NO:36,SEQ ID NO:38,SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO:52,SEQ ID NO:54,SEQ ID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:64,SEQ ID NO:66,SEQ IDNO: 68,SEQ ID NO:71,SEQ ID NO:73,SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的多肽至少有约45%序列同一性的氨基酸序列;以及

ii)任选地分离上述醇,该醇中,R是由9个碳构成的饱和的、单不饱和或多不饱和脂肪族基团,并且R可以是支链或由一个或多个非芳族环组成。

本发明更进一步提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反-香柠檬醇和表-β-檀香醇、澳白檀醇(lancelol)和/或它们的混合物,该方法包括:

i)使α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与多肽接触,从而生产醇,该多肽具有与从由SEQ ID NO:2,SEQ ID NO:4,SEQID NO:6,SEQ ID NO:8, SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34,SEQID NO: 36,SEQ ID NO:38,SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO:52,SEQ ID NO:54,SEQ ID NO:58,SEQ ID NO:60, SEQ ID NO:62,SEQ ID NO:64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO: 71,SEQ ID NO:73,SEQ ID NO:79和SEQ IDNO:81构成的群组中选出的多肽至少有约45%序列同一性的氨基酸序列;

ii)任选地分离上述醇。

本发明还提供一种生产α-甜橙醇、β-甜橙醇、α-檀香醇、β- 檀香醇,α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α-檀香萜、β- 檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450s 单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约36%的顺式异构体。

本发明更进一步提供一种分离的多肽,其具有单加氧酶活性并且包含氨基酸序列,该氨基酸序列与从由SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列有至少约45%、50%、 55%、50%、65%、70%、80%、90%、95%、98%或更多的同一性。

本发明更进一步提供一种分离的多肽,其具有单加氧酶活性并且包含氨基酸序列,该氨基酸序列与从由SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的氨基酸序列有至少约45%、50%、 55%、50%、65%、70%、80%、90%、95%、98%或更多的同一性。

本发明还提供一种分离的多肽,其具有单加氧酶活性并且包含从由SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34, SEQ ID NO:36,SEQ ID NO:71,SEQ IDNO:73,SEQ ID NO:79和SEQ ID NO:81构成的群组中选出的氨基酸序列。

本发明更进一步提供一种用于生产倍半萜醇的方法,该倍半萜醇从由α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇和澳白檀醇或它们的混合物构成的群组中选出,该方法包括:

i)在适合于生产具有单加氧酶活性的p450多肽的条件下培养细胞,其中,该细胞:

a)生产丙烯酸焦磷酸酯萜前体;

b)表达P450还原酶;

c)表达具有α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反 -香柠檬烯和/或表-β-檀香萜的合酶活性,并且生产α-法呢烯、β- 法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜的多肽;并且

d)表达具有氨基酸序列的多肽,该氨基酸序列与从由SEQ ID NO:2,SEQ ID NO:4,SEQ ID NO:6,SEQ ID NO:8,SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34,SEQ ID NO:36,SEQ ID NO:38, SEQ ID NO:40,SEQ ID NO:42,SEQ ID NO:44,SEQ ID NO:50,SEQ ID NO: 52,SEQ ID NO:54,SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ IDNO:64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:73, SEQ ID NO:79和SEQID NO:81构成的群组中选出的多肽至少有约 45%的序列同一性;并且

ii)任选地从该细胞中分离上述醇。

附图说明

具体实施方式

在一些实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使α-法呢烯、β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:2有至少约45%、50%、55%、60%、65%、70%、80%、90%、95%或 98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:4有至少约45%、50%、55%、60%、65%、 70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:6有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:8有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:28有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:30有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:32有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:34有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:36有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:38有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:40有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:42有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:44有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:50有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:52有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:54有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:58有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:60有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:62有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:64有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:66有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:68有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:71有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:73有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:79有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

在一些实施方案中,提供一种生产α-甜橙醇、β-甜橙醇、α- 檀香醇、β-檀香醇、α-反-香柠檬醇、表-β-檀香醇、澳白檀醇和/ 或它们的混合物的方法,该方法包括:使α-法呢烯、β-法呢烯、α- 檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜与多肽接触,该多肽包含与SEQ ID NO:81有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的同一性的氨基酸序列。在特定的实施方案中,该方法包含一种表达上述多肽的细胞。

本发明提供的用于生产多肽(其用于生产醇)的核苷酸序列具有与从由SEQ IDNO:1,SEQ ID NO:3,SEQ ID NO:5,SEQ ID NO:7,SEQ ID NO:27,SEQ ID NO:29,SEQ ID NO:31,SEQ ID NO:33,SEQ ID NO:35, SEQ ID NO:37,SEQ ID NO:39,SEQ ID NO:41,SEQ IDNO:43,SEQ ID NO: 49,SEQ ID NO:51,SEQ ID NO:53,SEQ ID NO:57,SEQ ID NO:59,SEQID NO:61,SEQ ID NO:63,SEQ ID NO:65,SEQ ID NO:67,SEQ ID NO:70, SEQ ID NO:72,SEQ ID NO:78和SEQ ID NO:80构成的群组中选出的序列至少有约45%、50%、55%、60%、65%、70%、80%、90%、95%或98%的同一性的核酸序列。本发明所提供的核苷酸序列是异源的,因为它们不是典型地也不是通常地由表达于其中的细胞来制造,并且其相对于其所引入的细胞通常不是内源的–其典型地从另一个细胞获得或可以合成出。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约36%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:28, SEQ ID NO:30,SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO: 64,SEQ ID NO:66,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、 65%、70%、80%、90%、95%或98%的序列同一性。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约46%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:30, SEQ ID NO:58,SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:66,SEQ ID NO: 68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、90%、 95%或98%的序列同一性。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约50%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、 50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约72%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与含有从由SEQID NO:58, SEQ ID NO:60,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、 50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约96%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO: 68,SEQ ID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、 90%、95%或98%的序列同一性。

在另一实施方案中,提供一种用于生产倍半萜醇的方法,该倍半萜醇包括α-甜橙醇、β-甜橙醇、α-檀香醇、β-檀香醇、α-反- 香柠檬醇、表-β-檀香醇、澳白檀醇和/或它们的混合物,该方法包括:使反-α-法呢烯、反-β-法呢烯、α-檀香萜、β-檀香萜、α-反-香柠檬烯、表-β-檀香萜和/或β-甜没药烯与具有P450单加氧酶活性的多肽接触,所生产出的倍半萜醇包含至少约100%的顺式异构体,上述多肽包含氨基酸序列,该氨基酸序列与具有从由SEQID NO:71和SEQ ID NO:73构成的群组中选出的氨基酸序列的多肽有至少约45%、50%、55%、60%、65%、70%、80%、90%、95%或98%的序列同一性。

还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQ IDNO:70和SEQ ID NO:72构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽包含与从由SEQ ID NOs:71和SEQ ID NO:73 构成的群组中选出的氨基酸序列有至少约45%、50%、55%、50%、 65%、70%、80%、90%、95%或98%或更多的同一性的氨基酸序列。更特别的是,所编码的多肽具有从由SEQ ID NOs:71和SEQ ID NO:73构成的群组中选出的序列。

还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQ IDNO:78和SEQ ID NO:80构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽包含与从由SEQ ID NOs:79和SEQ ID NO:82 构成的群组中选出的氨基酸序列有至少约45%、50%、55%、50%、 65%、70%、80%、90%、95%或98%或更多的同一性的氨基酸序列。更特别的是,所编码的多肽具有从由SEQ ID NOs:79和SEQ ID NO:82构成的群组中选出的序列。

还提供一种分离的核酸分子,其从如下构成的群组中选出:i) 具有从由SEQID.NO:27,29,31,33和35构成的群组中选出的核酸序列的核酸;以及ii)核酸分子,其编码具有P450单加氧酶活性的多肽,该多肽具有从由SEQ ID NO:28,SEQ ID NO:30,SEQ ID NO:32,SEQ ID NO:34和SEQ ID NO:36构成的群组中选出的序列。

在另一实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码与从由SEQ ID NO:71和SEQ ID NO:73 构成的群组中选出的多肽有至少约45%、50%、55%、50%、65%、 70%、80%、90%、95%或98%的序列同一性的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。

在更进一步的实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码具有从由SEQID NO:71和SEQ ID NO:73构成的群组中选出的序列的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。

在另一实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码与从由SEQ ID NO:79和SEQ ID NO:81 构成的群组中选出的多肽有至少约45%、50%、55%、50%、65%、 70%、80%、90%、95%或98%的序列同一性的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。

在更进一步的实施方案中提供一种用于生产具有P450单加氧酶活性的多肽的方法,该方法包括:把核酸转化到宿主细胞或非人类生物体的步骤,该核酸编码具有从由SEQID NO:79和SEQ ID NO:81构成的群组中选出的序列的多肽;以及在允许生产该多肽的条件下培养该宿主细胞或生物体的步骤。

上述醇可以转化成醛或羧酸,例如但不限于甜橙醛、檀香醛,香柠檬醛和澳白檀醛(lanceals)。所述醇、醛或酸可以进一步转化为衍生物,例如但不限于酯、酰胺、糖苷、醚或缩醛。

本发明所述的核酸和多肽可以分离自比如菊苣(Cichorium intybus L.)、巨大芽孢杆菌(Bacillus megaterium)、檀香树(Santalum Album) 和黄花蒿(Artemisia annua)。CYP71AV8,P450-BM3(CYP102A1)和 CYP71AV1包括变体均描述于此。

来自于植物菊苣(Cichorium intybus L.)的CYP71AV8已被定性为可区域选择性地氧化(+)-朱栾倍半萜生产反-诺卡醇、顺-诺卡醇和(+)-诺卡酮的P450单加氧酶。CYP71AV8也被发现催化大根香叶烯A和紫穗槐-4,11-二烯在C-12位置处的氧化(Cankar etal, FEBS Lett.585(1),178-182(2011))。野生型酶的氨基酸序列(NCBI登录号 NoADM86719.1,SEQ ID No 1和2)被用作设计在大肠杆菌(E.coli)中最优化表达的cDNA序列。

在真核生物中,P450单加氧酶是膜结合蛋白,并且这些蛋白质的N末端序列构成的膜锚定对这些酶的膜定位是至关重要的。蛋白质的这部分通常被富含脯氨酸结构域所划定,其对于酶活性的特异性的控制并不十分重要。因此,这个区域可以通过缺失、插入或突变进行修饰,而对催化活性不产生影响。然而,包含植物P450s的真核细胞色素P450s的N末端区域的特定修饰已经显示出在微生物中表达时具有对功能性重组蛋白的层级具有积极影响(Halkier et al(1995)Arch.Biochem.Biophys.322,369-377;Haudenschield et al(2000)Arch.Biochem.Biophys.379,127-136)。

在P450单加氧酶中,底物的识别和结合由被分布在蛋白质氨基酸序列的不同区域的几个氨基酸残基来控制。被定义为底物识别位点(SRS)的这些区域可以通过基于Gotoh所做的具体工作的简单的序列比对而被定位于任何P450的氨基酸序列中(Gotoh O(1992)J.Biol.Chem.267(1),83-90)。因此,在与底物相互作用并可以影响羟基化反应的区域选择性的CYP71AV8蛋白质残基为氨基酸Asn98 到Gly121、Thr198到Leu205、Lys232到Ile240、Asn282到Ala300、His355 到Arg367以及Thr469到Val 476。在这些区域内的一个或多个残基的修饰可以潜在地改变底物的特异性、其反应的立体化学性或者其区域选择性。作为通过P450来催化的反应的立体化学性的改变可见于Schalk et al(2002)Proc.Natl.Acad.Sci.USA 97(22),11948-11953。在该出版物中,植物P450酶的单个残基的变化会导致酶反应的区域专一性完全转变。

在这里“倍半萜合酶”或“具有倍半萜合酶活性的多肽”是指作为本申请的多肽,该多肽能够催化由从香叶基焦磷酸(GPP)、法呢基二磷酸(FPP)和香叶基香叶基焦磷酸(GGPP)构成的组中选出的无环焦磷酸萜前体合成为倍半萜分子或倍半萜分子的混合物。

α-檀香萜、β-檀香萜、α-反-香柠檬烯和/或表-β-檀香萜可通过例如美国专利公开号2011-0008836(公开日:2011年1月13日) 以及美国专利公开号2011-0281257(公开日:2011年11月27日) 中所述的合酶来制备,这两篇文献的整体并入于此。

根据本发明,多肽也意味着包括截短的多肽,条件是它们能够保持如任何上述实施方案所定义的P450单加氧酶活性。

在两个肽或核苷酸序列之间的同一性百分比是在已经生成这两个序列的比对时,在这两个序列中相同的氨基酸或核苷酸残基数目的函数。相同的残基定义为在这两个序列中在给定的比对位置上相同的残基。在此处使用的序列同一性的百分比是由最优比对,通过将在两个序列之间的相同残基的数目除以在最短的序列中的残基的总数然后乘以100而计算得到的。该最佳比对是同一性百分比可能是最高的比对。在该比对的一个或多个位置中的一或两个序列中引入间隙以便获得最佳比对。然后在序列同一性的百分比计算中将这些间隙作为不相同的残基来考虑。

以测定氨基酸或核酸序列同一性的百分比为目的的比对可以使用计算机程序以多种方式来实现,例如在万维网上可获得的公开可用的计算机程序。优选,可使用来自National Center for Biotechnology Information(NCBI)的地址为http://www.ncbi.nlm.nih.gov/BLAST/bl2seq/wblast2.cgi 的BLAST程序(Tatiana等,FEMSMicrobiol Lett.,1999,174:247-250, 1999),其参数设为默认,来获得肽或核苷酸序列的最优比对,以及来计算序列同一性的百分比。

特定的生物体或细胞,当其天然产生FPP时,或当其并不天然产生FPP但可经转化以产生FPP时,不管是用描述于此的核酸转化之前还是与所述核酸一起都意味着“能够产生FPP”。经转化的与天然存在的生物体或细胞相比产生更高量FPP的生物体或细胞也包括在“能够产生FPP的生物体或细胞”内。转化生物体(例如微生物)以便它们产生FPP的方法已经是本领域公知的。这些方法可以例如在文献中找到,例如在下列出版物中:Martin,V.J.,Pitera, D.J.,Withers,S.T.,Newman,J.D.和Keasling,J.D.Nat Biotechnol.,2003,21(7), 796-802(大肠杆菌(E.coli)的转化);Wu,S.,Schalk,M.,Clark,A.,Miles,R.B.,Coates,R.和Chappell,J.,Nat Biotechnol.,2006,24(11),1441-1447(植物的转化);Takahashi,S.,Yeo,Y.,Greenhagen,B.T.,McMullin,T.,Song,L., Maurina-Brunker,J.,Rosson,R.,Noel,J.,Chappell,J,Biotechnology andBioengineering,2007,97(1),170-181(酵母的转化)。

适合于在体内进行本发明方法的非人宿主生物体可以是任何非人的多细胞或单细胞生物体。在优选的实施方案中,用于在体内进行本发明的非人宿主生物体是植物、原核生物或真菌。可以使用任何植物、原核生物或真菌。特别地,可使用的植物是天然产生高数量萜的植物。在更优选的方案中,该植物选自茄科 (Solanaceae)、禾本科(Poaceae)、十字花科(Brassicaceae)、碟形花科 (Fabaceae)、锦葵科(Malvaceae)、菊科(Asteraceae)或唇形科(Lamiaceae)。例如,该植物选自烟草属(Nicotiana)、茄属(Solanum)、高粱属(Sorghum)、拟南芥属(Arabidopsis)、芸苔属(Brassica(油菜))、苜蓿属 (Medicago(紫花苜蓿))、棉属(Gossypium(棉花))、蒿属(Artemisia)、鼠尾草属(Salvia)和薄荷属(Mentha)。优选,该植物属于烟草(Nicotiana tabacum)种。

在更优选的实施方案中,用于在体内进行本发明方法的非人宿主生物体是微生物。可以使用任何微生物,但是根据更加优选的实施方案,所述微生物是细菌或酵母。最优选,所述细菌是大肠杆菌(E.coli),所述酵母是酿酒酵母(Saccharomyces cerevisiae)。

这些生物体中的一些天然不会产生FPP。为了适于进行本发明的方法,必须将这些生物体转化以产生所述前体。如以上所述,用上述任何实施方案描述的用核酸修饰之前或者同时,可以将它们原样转化。

还可以使用分离的高等真核细胞代替完整生物体作为宿主以便在体内实施本发明的方法。适合的真核细胞可以是任何非人的细胞,但是优选是植物细胞或真菌细胞。

在此处所使用的多肽指的是包含于此处标明的氨基酸序列的多肽或肽片段、以及截短的或变体多肽,条件是它们保持如以上定义的P450单加氧酶活性,而且它们与相应的多肽共有至少所定义的百分比的同一性。

变体多肽的实例是由选择性mRNA剪接或如此处所述形成多肽蛋白酶剪切而得到的天然存在的蛋白质。可归因于蛋白水解的变体包括,例如,当在不同类型的宿主细胞中表达时,由于从本发明多肽上蛋白水解移除一个或多个末端氨基酸而导致在N-或C- 末端的差异。本发明也包含如后所描述的用由本发明核酸的天然或人工突变而获得的核酸编码的多肽。

由在氨基和羧基末端融合另外的肽序列而产生的多肽变体也可用于本发明的方法。尤其是这样的融合可以提高多肽的表达,在期望的环境或表达系统中利于蛋白质的提纯或多肽酶活性的改善。这种另外的肽序列可以例如是信号肽。因此,本发明包含使用变体多肽的方法,例如通过与其它寡肽或多肽融合而获得的多肽和/或与信号肽连接的多肽。在本发明的方法中有利地还可以使用由与另外的功能蛋白(例如来源于萜生物合成路径的另外的蛋白质)融合而产生的多肽。

此处所使用的多肽指的是含有此处确定的氨基酸序列的多肽或肽片段、以及截短的或变体多肽,条件是它们保持如上定义的活性。

变体多肽的实例是由选择性mRNA剪接或此处所述形成多肽蛋白酶剪切而得到的天然存在的蛋白质。可归因于蛋白水解的变体包括,例如,当在不同类型的宿主细胞中表达时,由于从本发明多肽上蛋白水解去移除一个或多个末端氨基酸而导致在N-或C- 末端的差异。本发明还包含用如后所描述的由本发明核酸的天然或人工突变而获得的核酸编码的多肽。

由另外的肽序列在氨基和羧基末端融合而产生的多肽变体也包含于本发明的多肽之中。尤其是这样的融合可以在期望的环境或表达系统中提高多肽的表达,利于蛋白质的提纯或多肽酶活性的改善。这种另外的肽序列可以例如为信号肽。因此,本发明包括本发明的多肽的变体,例如通过与其它寡肽或多肽融合而获得的多肽变体和/或与信号肽连接的那些多肽变体。本发明的多肽还可以包括由与另外的功能蛋白(例如来源于萜生物合成路径的另外的蛋白质)融合而产生的多肽。

本发明的核酸可以定义为包括单链或双链形式的脱氧核糖核苷酸或核糖核苷酸聚合物(DNA和/或RNA)。术语“核苷酸序列”也应该被理解为包括分离片段形式或作为较大核酸组分的聚核苷酸分子或寡核苷酸分子。本发明的核酸还包含某些分离的核苷酸序列,其包括基本上不污染内源材料的那些核苷酸序列。本发明的核酸可以是截短的,条件是其编码本发明包含的如上所述的多肽。

另一个用于转化适于在体内实施本发明方法的宿主生物体或细胞的重要工具是包括本发明任何实施方案的核酸的表达载体。因此这种载体也是本发明的目的之一。

如此处所使用的“表达载体”包括任何直线的或环状的重组载体,其包括但不限于病毒载体、噬菌体和质粒。技术人员能够根据表达系统选择适合的载体。在一个实施方案中,该表达载体包括本发明的核酸,其可操作地连接到至少一种控制转录、翻译、开始和终止的调节序列,如转录的启动子、操纵子或增强子,或mRNA核糖体的结合部位,并且非强制选择地包括至少一种选择标记。当该调节序列功能上与本发明的核酸相关时,该核苷酸序列是“可操作地连接的”。

本发明的表达载体可在如下进一步公开的用于在包藏本发明核酸的宿主生物体和/或细胞中制备遗传转化的宿主生物体和/或细胞的方法和生产或制造具有P450单加氧酶活性的多肽的方法中使用。

经转化以包藏本发明至少一种核酸以便使其异源表达或过表达本发明至少一种多肽的重组非人宿主生物体和细胞也是实施本发明方法的十分有用的手段。因此,这种非人宿主生物体和细胞是本发明的另一个目的。

根据任何上述实施方案的核酸可用于转化该非人宿主生物体和细胞并且所表达的多肽可以是任何上述的多肽。

适合于在体内进行本发明方法的非人宿主生物体可以是任何非人的多细胞或单细胞生物体。在优选的实施方案中,非人宿主生物体是植物、原核生物或真菌。可以使用任何植物、原核生物或真菌。特别地,可使用的植物是天然产生高数量萜的植物。在更优选的方案中,该植物选自茄科(Solanaceae)、禾本科(Poaceae)、十字花科(Brassicaceae)、碟形花科(Fabaceae)、锦葵科(Malvaceae)、菊科(Asteraceae)或唇形科(Lamiaceae)。例如,该植物选自烟草属 (Nicotiana)、茄属(Solanum)、高粱属(Sorghum)、拟南芥属(Arabidopsis)、芸苔属(Brassica(油菜))、苜蓿属(Medicago(紫花苜蓿))、棉属(Gossypium(棉花))、蒿属(Artemisia)、鼠尾草属(Salvia)和薄荷属 (Mentha)。优选,该植物属于烟草(Nicotiana tabacum)种。

在更优选的实施方案中,用于在体内进行本发明方法的非人宿主生物体是微生物。可以使用任何微生物,但是根据更加优选的实施方案,所述微生物是细菌或酵母。最优选,所述细菌是大肠杆菌(E.coli),所述酵母是酿酒酵母(Saccharomyces cerevisiae)。

分离的高等真核细胞还可以经转化以代替完整生物体。作为高等真核细胞,可以是除酵母细胞外的任何非人的真核细胞。特别优选的高等真核细胞是植物细胞或真菌细胞。

术语“经转化”是指宿主经过遗传工程以便使其含有上述任何实施方案中所需要的每个核酸的一个、两个或更多个拷贝。优选地,术语“经转化”涉及异源表达由该核酸编码的多肽(该多肽使用所述核酸转化)以及过表达所述多肽的宿主。因此,在实施方案中,本发明提供了经转化的生物体,其中该多肽的表达量高于未经如此转化的相同生物体的表达量。

现有技术中已知有多种方法用于生成转基因宿主生物体或细胞,如植物、真菌、原核生物,或高等真核生物体的细胞培养物。适用于细菌、真菌、酵母、植物和哺乳动物细胞宿主的合适的克隆和表达载体的描述参见例如Pouwels等的Cloning Vectors:A LaboratoryManual,1985,Elsevier,New York和Sambrook等的Molecular Cloning:A LaboratoryManual,第二版,1989,Cold Spring Harbor Laboratory Press。尤其是用于高等植物和/或植物细胞的克隆和表达载体是技术人员可以得到的。参见例如Schardl等的Gene 61:1-11,1987。

转化宿主生物体或细胞以使其包藏转基因核酸的方法为技术人员所熟知。对于生产转基因植物,例如通用的方法包括:植物原生质体的电穿孔法、脂质体介导的转化法、土壤杆菌介导的转化法、聚乙二醇介导的转化法、粒子轰击法、植物细胞的显微注射法和应用病毒的转化法。

在一个实施方案中,转化的DNA被整合到非人宿主生物体和 /或细胞的染色体中,从而得到稳定的重组体系统。现有技术中任何公知的染色体整合方法可以应用于本发明的实践中,包括但不限定为重组酶介导的盒式交换(RMCE)、病毒位点特异性染色体插入法、腺病毒法和核内注射法。

“多肽变体”这里指的是这样一种多肽,其具有如上所述的活性且基本上与根据任何上述实施方案的多肽同源,但是其具有的氨基酸序列因为一处或多处缺失、插入或取代而不同于由本发明任一核酸所编码的氨基酸序列。

变体可以包括保守取代序列,意味着某一特定氨基酸残基被一具有类似理化特性的残基所取代。保守取代的实例包括:用一个脂肪族残基取代另一个脂肪族残基,例如,Ile、Val、Leu或Ala 之间相互取代;或者用一个极性残基取代另一个极性残基,例如在Lys和Arg之间、Glu和Asp之间,或Gln和Asn之间相互取代。参见Zubay,Biochemistry,1983,Addison-Wesley Pub.Co.。这类取代的效果可以用取代打分矩阵,如Altschul,J.Mol.Biol.,1991,219, 555-565中所述的PAM-120、PAM-200和PAM-250来计算。其他这类保守取代,例如具有近似疏水特性的完整区域的取代,已为本领域熟知。

天然生成的肽变体也包含在本发明范围之内。这种变体的实例是由于选择性mRNA剪接或者本文所述多肽的蛋白酶剪切而得到的蛋白质。蛋白水解引起的变体包括,例如,在不同类型宿主细胞中表达时,由本发明序列编码的多肽由于蛋白水解移除了一个或多个末端氨基酸而导致N-或C-末端的差异。

本发明多肽的变体可以用于获得例如酶活性期望的增强或减弱、区域化学(regiochemistry)或立体化学的修饰、或者底物利用或者产物分配的改变、对于底物的亲和力的增加、一种或多种想得到化合物的产量的改进的特异性、酶反应速率的增加、在特定环境(pH、温度、溶剂等等)中的更高活性或稳定性、或者在想要的表达系统中的改进的表达水平。变体或定位突变体可以通过现有技术已知的任何方法来生成。天然多肽的变体和衍生物能够通过分离其它或相同植物品系或物种(例如来自檀香属物种的植物)的天然产生的变体或者变体的核苷酸序列来获得,或者通过对编码本发明多肽的核苷酸序列进行人工规划突变来获得。天然氨基酸序列的改变可通过多种传统方法中的任一种完成。

由附属肽序列在本发明多肽的氨基和羧基末端融合产生的多肽变体可被用于增强多肽的表达,在期望的环境或表达系统中利于蛋白的纯化或提高多肽的酶活性。这种附属肽序列例如可为信号肽。因此,本发明包含本发明多肽的变体,例如通过与其它寡肽或多肽和/或连接到信号肽上的多肽融合获得的那些变体。包含在本发明范围内的融合多肽还包括由融合其它功能蛋白质如来自萜生物合成路径的其它蛋白质而产生的融合多肽。

本发明生产出的上述醇可以通过提取而从在自然界中产生的醇中分离,例如使用公知的方法来提取(例如,从檀香木中提取)。本发明产生出的醇用作可用于香料的芳香化合物。

aaCPR 黄花蒿(Armisia annua)细胞色素P450还原酶

Bp 碱基对

Kb 千碱基

DNA 脱氧核糖核酸

cDNA 互补DNA

ClASS 黄皮(Clausena lansium)(+)-α-檀香萜合酶

CPRm 椒样薄荷(Mentha piperita)细胞色素P450还原酶

DTT 二硫苏糖醇

EDTA 乙二胺四乙酸

FPP 法呢基焦磷酸

GC 气相色谱

IPTG 异丙基-D-硫代半乳糖苷

LB 溶菌肉汤

MS 质谱

MTBE 甲基叔丁基醚

PCR 聚合酶链式反应

RMCE 重组酶介导的盒式交换

RNA 核糖核酸

mRNA 信使核糖核酸

SaSAS 檀香树(Santalum album)(+)-α-檀香烯/(-)-β-檀香烯合酶

下述实施例仅为示例,并不意味着限制本发明的发明内容、说明书或权利要求书中所规定的范围。

实施例

优化的CYP71AV8 cDNA序列在细菌中的表达

CYP71AV8的膜锚定区域被重新设计以引入如下所示的修饰。

在优化的CYP71AV8序列中,修饰5’-末端以将膜锚定区域的首个氨基酸替代为显示出提升在细菌细胞中膜结合P450s的异源表达性的多肽序列(Alkier,B.A.etal.Arch.Biochem.Biophys.322,369-377 (1995),Haudenschield,et alArch.Biochem.Biophys.379,127-136(2000))。此外,作为整个cDNA,密码子的使用适于匹配大肠杆菌(E.coli)密码子的使用。因此,多个cDNA’被设计为与CYP71AV8不同的3’末端修饰和优化:

-CYP71AV8-65188:在此结构中,前22个密码子被用于编码MALLLAVFWSALIILV肽的序列代替(SEQ ID NO 3和4)。

-CYP71AV8-P2:整个锚定编码序列被源于薄荷 (Haudenschield,et alArch.Biochem.Biophys.379,127-136(2000)中的PM2) 的优化的柠檬烯羟化酶的锚定序列代替(SEQ ID NO 5和6)。

CYP71AV8-P2O:该结构编码与上述结构相同的蛋白质,但膜锚定区域更进一步进行了密码子优化(SEQ ID NO 7和8)。

在图1中,比较不同的CYP71AV8变体的N-末端区域的氨基酸序列,在图2中,比较三种结构的DNA序列。该三种优化的 CYP71AV8 cDNA在体内合成(DNA2.0,Menlo Park,CA,USA),并且克隆为NdeI-HindIII片段插入于pCWori表达质粒(Barnes,H.J. MethodEnzymol.272,3-14;(1996))。

在细菌细胞中CYP71AV8的功能性表达

对于异源表达,将CYP71AV8表达质粒转化到JM109大肠杆菌细胞(实施例1)。将转化体的单菌落用在含50μg/mL氨苄青霉素的5毫升LB培养基的接种培养物上。在37℃使细胞生长10~12 小时。然后将培养物接种到添加了50μg/m氨苄青霉素和1mM硫胺盐酸盐的250毫升TB培养基(极品肉汤)中。该培养物在28℃以适当摇动(200rpm)下温育3~4小时,之后添加75毫克/升δ-氨基乙酰丙酸(σ)和1mM IPTG(异丙基-β-D-1-硫代半乳糖苷),然后在 28℃以200rpm摇动下将培养物保持24~48小时。

P450酶的表达可以定性评估,并通过大肠杆菌的蛋白组成的 CO结合光谱(Omura,T.&Sato,R.(1964)J.Biol.Chem.239,2379-2387)定量测量。对于蛋白提取,将细胞离心(10分钟,5000g,4℃)并在 35毫升冰的缓冲液1(100mM的Tris-HCl,pH 7.5,20%甘油,0.5mMEDTA)中重悬。添加1体积的于水中的0.3mg/ml溶菌酶 (Sigma-Aldrich),并在4℃搅拌该悬浮液10~15分钟。在4℃将悬浮液于7000g离心10分钟,将粒状物在20ml缓冲液2(25mMKPO

按照此步骤,测定了重组CYP71AV8在450nm处具有最大吸光度的典型的CO光谱,验证了正确折叠形成有功能P450酶。

在细菌中CYP71AV8与植物P450还原酶的共表达

为了重组植物p450的活性,第二膜蛋白的存在是必要的。该蛋白,即P450还原酶(CPR),参与到将电子从辅助因子NAPDH(还原形式的烟酰胺腺嘌呤二核苷酸磷酸)转移到P450活性位点。已经表明来自一种植物的CPR可以完善来自另一植物的P450酶的活性(Jensen和Moller(2010)Phytochemsitry 71,132-141)。多个编码 CPRDNA序列已被报道来自不同的植物。我们首先选择一个分离自椒样薄荷(Mentha piperita)(CPRm,未公开数据,SEQ ID NO 10) 的CPR,优化全长cDNA的密码子使用(SEQ ID No 9),并且将其克隆至pACYCDuet-1表达质粒(Novagen)的NcoI和HindIII限制性位点以提供质粒pACYC-CPRm。

使用pCWori-CYP71AV8-65188和pACYCDuet-CPRm这两种质粒在大肠杆菌细胞中共表达CYP71AV8和CPRm。将这两种质粒共转化到BL21 Star

采用表达CYP71AV8的大肠杆菌细胞进行的(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的生物转化

如上所述采用被改造成根据异源甲羟戊酸途径生产法呢基二磷酸(FPP)并表达植物的倍半萜合酶的大肠杆菌细胞来制备在生物转化测定中用作底物的不同的倍半萜烃类。该大肠杆菌宿主细胞的改造以及应用如专利WO2013064411或Schalk et al(2013)J.Am.Chem.Soc.134,18900-18903所述。简而言之,制备表达质粒使其含两个操纵子,该操纵子由编码完整的甲羟戊酸途径的基因的酶组成。体外合成第一合成的操纵子(DNA2.0,MenloPark,CA,USA),该操纵子由大肠杆菌乙酰乙酰辅酶A硫解酶(atoB)、金黄色葡萄球菌的HMG-CoA合酶(mvaS)、金黄色葡萄球菌HMG-CoA还原酶 (mvaA)和酿酒酵母FPP合酶(ERG20)基因组成,并将其连接到用 NcoI-BamHI消化的pACYCDuet-1载体(Invitrogen)上,得到 pACYC-29258。包含甲羟戊酸激酶(MvaK1)、磷酸甲激酶 (MvaK2)、甲羟戊酸磷酸脱羧酶(MvaD)和异戊烯基二磷酸异构酶(idi)的第二操纵子扩增自肺炎链球菌(ATCCBAA-334)的基因组 DNA,并且将该第二操纵子连接到pACYC-29258的第二多克隆位点以提供质粒pACYC-29258-4506。因此,这种质粒包含编码引起乙酰辅酶A变为FPP的生物合成途径的任何酶的基因。质粒 pACYC-29258-4506与质粒pET101-Cont2_1(包含编码黄皮 (Clausena lansium)(+)-α-檀香萜合酶(ClASS)的cDNA,WO2009109597) 或者质粒pETDuet-SCH10-Tps8201-opt(包含编码檀香树(Santalum album)(+)-α-檀香萜/(-)-β-檀香萜合酶(SaSAS)的cDNA,WO2010067309)共转化到大肠杆菌细胞(BL21 Star

通过在使用上述作为底物列出的倍半萜分子的大肠杆菌中的生物转化来评价CYP71AV8的酶活性。如实施例3所述那样培养并获取用pACYCDuet-CPRm和pCWori-CYP71AV8-65188转化的 BL21 Star

在这些条件下,观察到(+)-α-檀香萜的氧化。转化的主要产物是(E)-α-檀香醇。检测到通过大肠杆菌内源性酶从(E)-α-檀香醇的转化而成的其他产物:(E)-α-檀香醛(通过醇脱氢酶而产生)和 (E)-α-二氢檀香醇(通过烯酸还原酶而产生)(图3A)。相似的,使用(+)-α-檀香萜或(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的混合物作为底物,观察到(E)-α-檀香醇、 (E)-β-檀香醇,(E)-α-反-香柠檬醇和(E)-表-β-檀香醇的形成,也获得了更进一步的代谢产物(图3B)。本实施例表明CYP71AV8可用于(+)-α-檀香萜、(-)-β-檀香萜以及相似结构的分子的末端氧化。

从单种质粒中构建合成操纵子以共表达CYP71AV8和CPR

数个双顺反子操纵子被设计为在特殊的启动子的控制下从单种质粒中表达P450酶和CPR。优化的CYP71AV8 cDNAs的三种变体(实施例1)与两种CPR cDNAs结合:密码子优化的CPRm cDNA(实施例2)以及用于编码黄花蒿(Artemisia annua)CPR(NCBI 登录号ABM88789.1,SEQ ID No 12)的密码子优化的cDNA(Seq ID No 11)。因此,设计出六种结构(Seq ID No 13-18),每种结构均包含P450 cDNA,接下来是包括核糖体结合位点的接头序列(RBS) 和CPR cDNA(图4)。该结构通过PCR来制备:P450和CPR cDNAs 是分别扩增的,并具有5’和3’突出端(overhang),其适于在pCWori+ 质粒的NdeI-HindIII位点中采用In-

为了评价不同N末端修饰对P450s和与CPRs耦合的影响,将六种质粒转入到大肠杆菌BL21 Star

在工程细胞中体内生产含氧倍半萜

(+)-α-檀香萜和(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯、(+)-表-β-檀香萜或其它相似结构的分子的氧化产物也可直接产自被改造成从比如葡萄糖或甘油的碳源来生产倍半萜的大肠杆菌细胞。制备出包含由P450、CPR和萜类合成酶构成的合成操纵子的pCWori+质粒构成的质粒(Barnes H.J(1996)Method Enzymol.272, 3-14)。作为P450,采用CYP71AV8-P2或CYP71AV8-P2O cDNA,作为萜类合成酶,采用黄皮(Clausena lansium)(+)-α-檀香萜合酶 cDNA(ClASS)(WO2009109597)或编码檀香树(Santalum album)(+)-α-檀香萜/(-)-β-檀香萜合酶(SaSAS)的cDNA (WO2010067309)。采用下述工序来构造出四种质粒。设计并合成出ClASS cDNA的密码子优化版本(SEQ ID NO 19-20)(DNA 2.0),并且克隆到pETDUET-1质粒(Novagen)的NdeI-KpnI位点以提供质粒pETDuet-Tps2opt。作为SaSAS,设计出优化的全长cDNA (SEQ ID NO 21-22),合成并克隆到pJexpress414质粒(DNA2.0)以提供质粒pJ414-SaTps8201-1-FLopt。设计出各种结构引物,用于采用In-

将四种质粒中的任一种与带来完整的甲羟戊酸途径的质粒 pACYC-29258-4506共转化到大肠杆菌BL21 Star

在生物转化实验中观察到所有所得的菌株产生出倍半萜烃类以及相应的含氧产物(图6)。该实验显示出使用表达CYP71AV8 的工程细胞,可生产出倍半萜(E)-α-檀香醇、(E)-β-檀香醇和其它相似结构的分子。

使用CYP71AV8变体以生产(E)-α-檀香醇和(E)-β-檀香醇

根据上述实施例,我们示出了CYP71AV8对(+)-α-檀香萜和 (-)-β-檀香萜的“末端反式碳”具有高度选择性,并且专门生产(E)-α- 檀香醇、(E)-β-檀香醇。在本实施例中,我们描述了一种定点诱变的方法来修饰CYP71AV8酶活性,以便产生(Z)-α-檀香醇和(Z)-β-檀香醇。首先选出L358作为控制酶活性的活性位点残基。 CYP71AV8的一系列变体通过将用作编码L358的密码子替代为用于编码其它氨基酸的密码子从而生产出。通过两步PCR工序来导入突变,该两步PCR工序使用简并寡核苷酸(包含NBT(N=A,C,G,T; B=C,G,T)密码子以替代L358编码密码子)和特异性寡核苷酸的组合。此种寡核苷酸的组合允许将L358编码密码子变为编码包括所有具有疏水性侧链的氨基酸的其它12种残基的密码子。采用诱变反向引物AV8-L358-rev (5'-CACGCGGCATCACCAGCGGAVNCGGCGGATGCAGGCGCAGGGTTTCTTTAATC-3')和引物AV8-pcw-fw(5’- CATCGATGCTTAGGAGGTCATATGGCTCTGTTATTAGCAG-3’) 实施第一步PCR以扩增cDNA的5’部分。采用引物AV8-L358-fw (5'-TCCGCTGGTGATGCCGCGTGAGTGC-3')和AV8-CPR-rev(5'- ATATATCTCCTTCTTAAAGTTAGTCGACTCATTAGGTG-3')来扩增第二PCR产物。对于这两步扩增,均采用 pCWori-CYP71AV8-P2-CPRm-ClASS作为模板。第二轮扩增采用上述两种PCR产物作为模板和引物AV8-L358-fw+ AV8-CPR-rev,并且允许扩增全长CYP71AV8变体cDNAs。所有 PCR反应均可按照生产商的指导而采用PfuUltra II融合HS DNA 聚合酶(Stratagene)。可采用Gibson装配预混液(NewEngland Biolabs),将经修饰的cDNA连接到通过NdeI-SalI消解的 pCWori-CYP71AV8-P2-CPRm-ClASS。最终结构通过测序而被控制,并且可为各种期望的CYP71AV8变体选择一种质粒克隆体。也可通过将Leu358替代为Ala、Phe、Thr、Ser、Val、Gly、Ile、Met、 Pro、Tyr、Trp和Arg来生产出其它12种变体(SEQ ID NO 27~50)。

采用如实施例6所述的体内倍半萜生产方法来进行各种 CYP71AV8变体的评价。简单来说,含有任一种CYP71AV8变体 cDNA、CPRm cDNA和ClASS cDNA的pCWori+质粒与pACYC-29258质粒一起共转化到KRX大肠杆菌细胞(Promega)中。如实施例6所述的那样选择并培养经转化的细胞,并且评价倍半萜的生产。如图7所示,与野生型P450酶相比,生产出除了反式氧化产物的某些变体(Z)-α-檀香醇。对于各变体,通过将所生产的 (Z)-α-檀香醇的总量除以含氧α-檀香萜衍生物的总量来计算顺反式氧化率。各种变体的此种计算的结果示于如下表1:

表1.CYP71AV8野生型酶的区域选择性以及α-檀香萜的氧化的活性位点变体

表1中所示的上述数据表现出,CYP71AV8可被改造并用于生产(Z)-α-檀香醇。特别是L358T、L358S、L358A和L358F变体可被用作以高达46%的顺式末端碳的选择性进行的(+)-α-檀香萜的末端氧化.

在类似方法中,评价了CYP71AV8变体的(Z)-β-檀香醇的生产。通过将上述质粒中的ClASS cDNA替代为SaSAS cDNA来制备新的质粒。因此,可通过限制酶HindIII和EcoRI来消解质粒 pCW-CYP71AV8-L358F-CPRm-ClASS,从而去除ClASS cDNA。同时,可通过相同的酶来消解pCWori-CYP71AV8-P2-CPRm-SaSAS,从而恢复SaSAS的cDNA兼容粘性末端。采用T4DNA连接酶 (NEW England Biolabs)将线性化载体与消解的插入段连接。如上所述,使用如此获得的质粒用于在相同条件下在大肠杆菌细胞中进行的含氧倍半萜的体内生产。图8示出通过CYP71AV8-L358F 形成的产物的分析的GCMS概况,显示出经修饰的CYP71AV8酶也可用于生产(Z)-β-檀香醇。

CYP71AV族的其它成员的评价

通过具有檀香萜骨架的倍半萜的氧化来评价CYP71AV1 (NCBI登录号ABB82944.1)。制备出结构类似于实施例5的质粒的质粒:设计出包含用于编码N-末端经修饰的CYP71AV1蛋白质 (SEQ ID NO 53和54)的优化的cDNA和aaCPR cDNA(实施例5) 的双顺反子操纵子,并且进行体内合成(DNA2.0),并作为双顺反子操纵子而克隆至pCWori+质粒。上述质粒用于转化KRX大肠杆菌细胞(Promega)。如实施例3那样培养经转化的细胞并且引起蛋白质表达。如实施例4那样进行使用(+)-α-檀香萜作为底物的生物转化实验。如图9所示,可获得与CYP71AV8相同的产物(即(E)-α- 檀香醇和(E)-α-檀香醛),这显示出CYP71AVP450族的其它成员也可被用于檀香萜的末端氧化。

使用CYP71AV1,制备出包含CYP71AV1 cDNA、aaCPR和 (+)-α-檀香萜合酶cDNA(ClASS)的合成操纵子。通过NdeI和 HindIII来消解包含CYP71AV8-P2-CPRm-ClASS操纵子的 pCWori+质粒(实施例6),从而切除P450编码cDNA。同时,如上一段中所述的那样,以相同的酶来消解从而从双顺反子操纵子恢复出CYP71AV1 cDNA,采用T4 DNA连接酶(NEWEngland Biolabs)而连接至经消解的上述pCWori质粒,产生质粒 pCWori-CYP71AV1-CPRm-ClASS。该质粒与质粒 pACYC-29258-4506被用于共转化大肠杆菌BL21 Star

P450-BM3(CYP102A1)突变体库的构建

通过将五种疏水性氨基酸(丙氨酸,缬氨酸,苯丙氨酸,亮氨酸和异亮氨酸)系统地合并到靠近P450-BM3的血红素的中心的两个位置,从而构建出24种变种的P450-BM3突变体库。改变这两个氨基酸的侧链大小已经显示出可显著改变紧靠在血红素基团的底物结合腔的形状(Appl Microbiol Biotechnol 2006,70:53;Adv Synth Catal 2006,348:763)。已知晓第一热点(Phe 87)改变底物的特异性和区域选择性,同时,也已预测到第二位置(Ala328)在氧化时与所有底物相互作用(ChemBiochem 2009,10:853)。可采用 QuickChange

α-檀香萜:P450-BM3库的体外筛查

如先前报道的那样,将24种P450-BM3突变体和酶的野生型版本异源表达到大肠杆菌BL21(DE3)细胞中(Adv.Synth.Catal.2003, 345:802)。简单来说,将经转化细胞的单菌落用于接种到2毫升的添加了30μg/ml卡那霉素的LB培养基中,并且伴随着轨道振荡(150rpm)在37℃下生长,直至OD

如实施例4所述,制备在生物转化实验中用作底物的α-檀香萜。转化在包含~0.5μM CYP酶、2%(v/v)DMSO和0.2mMμ-檀香萜底物的1ml的50mM磷酸钾缓冲液中进行。通过加入0.1mM NADPH开始反应,并且伴随着温和摇动在室温下进行22小时.

然后,在装有FS-Supreme-5色柱(30m×0.25mm×0.25μm)的 GC/MS QP-2010仪器(Shimadzu,Japan)上分析样品,氦作为载气 (流速:0.68毫升/分钟;线速度:30厘米/秒)。使用电喷雾离子化来收集质谱。注射器温度设定为250℃。色柱烘箱设为50℃ 1min, 然后以30℃/min的速度升温到170℃,随后以5℃/min的速度升温到185℃,维持等温3min,然后以5℃/min的速度升温到200℃,然后以30℃/min的速度升温到300℃,最后维持等温1min。

P450-BM3库的α-檀香萜体内筛查

同样使用被改造成从简单碳源生产(+)-α-檀香萜的细菌菌株以在体内筛查P450-BM3突变体库。为此,将实施例4的FPP-高产菌株用含有源于黄皮(Clausena lansium)(ClASS)(WO2009109597) (SEQ ID No 19和20)的密码子优化版本(+)-α-檀香萜合酶的pETDuet-1质粒进行转化,并且分别将各P450-BM3变体克隆到载体的第一和第二多克隆位点(MCS)。或者,将(+)-α-檀香萜合酶 cDNA克隆至pET101表达质粒(Novagen)中,并且将来自库的各 P450-BM3突变体克隆到pCDFDuet-1载体(Novagen)中。所得到的重组载体被共转化到FPP-高产菌株中。

将经转化细胞的单菌落用于接种到5毫升的添加了合适抗生素的LB培养基中。以250rpm在37℃下过夜温育培养物。第二天,将200μl的过夜培养物接种到添加了3%甘油、1mM盐酸硫胺素(Sigma-Aldrich,St Louis,MI)和75μg/Lδ-氨基乙酰丙酸 (Sigma-Aldrich)的2mL的极品肉汤(TB)培养基中,并且以250rpm 在37℃下温育。在4~6小时的培养后(或当培养物在600nm的光密度达到2~3的值时),将培养物冷却到28℃,并且通过0.1mM IPTG来引起蛋白质表达。在此时,将10%(v/v)的十二烷加入到生长培养基。在伴随轨道振荡(250rpm)温育48h后,通过1体积的甲基叔丁基醚(MTBE)来对细胞培养基进行两次萃取,并且通过 GC/MS分析溶剂萃取物。在装有DB1色柱(30m x 0.25mm x 0.25mm膜厚;Agilent)以及5975系列质谱仪的Agilent 6890系列 GC系统中进行GC/MS。载气为恒定流速1毫升/分钟的氦气。注射处于无分流模式,并且将进样口温度设定在250℃,烘箱温度设定为以10℃/min的速度从50℃到225℃,然后以20℃/min的速度升到320℃。基于保留指数的一致性和可信标准的质谱来确认产物的身份。

P450-BM3突变体库的体外(实施例10)与体内筛查提供了可比较的结果,将该结果归纳于表2。P450-BM3野生型(SEQ ID No 55和56)没有显示出(+)-α-檀香萜有任何可检测到的活性,而6种P450-BM3变体能将α-檀香萜转换为所期望的α-檀香醇。这些变体揭示出对于(+)-α-檀香萜的顺式末端碳的氧化的45%~96%的优选性。单一突变体#23(A328V)(SEQID No 67和68)和双突变体#7 (F87I/A328I)(SEQ ID No 57和58)、#17(F87V/A328I)(SEQID No 59和60)和#18(F87V/A328L)(SEQ ID No 61和62)显示出在 72%~96%范围内的最高的区域选择性(表2和图10)。两种附属变体#19(F87V/A328V)(SEQ ID No 63和64)和#20(F87V/A328F)(SEQ ID No 65和66)对于顺式羟基化的选择性较差 (45%~50%的范围),并且会生成附属氧化产物。

表2.通过P450-BM3变体的α-檀香萜向α-檀香醇的转化

上述结果表明:P450-BM3活性位点突变能够使非天然底物 (+)-α-檀香萜结合。选定的P450-BM3变体引入了这些突变体表明:有选择地使(+)-α-檀香萜的顺式末端碳羟基化,从而生产嗅觉明显的化合物(Z)-α-檀香醇(图10)。

使用P450-BM3双突变体的(Z)-α-檀香醇、(Z)-β-檀香醇、 (Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇的体内生产

测试在α-檀香萜筛查中确定的一种P450-BM3变体(变体#17;表2)的氧化由(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)- 表-β-檀香萜构成的倍半萜烃的檀香油状混合物的能力。为此,将实施例4所述的FPP-高产细菌菌株用包含编码檀香树(Santalumalbum)(+)-α-檀香萜/(-)-β-檀香萜合酶(WO2010067309)(SEQ ID No 21和22)的密码子优化cDNA的重组pETDuet-1表达载体转化到第一MCS中,并且将P450-BM3变体#17转化到第二MCS中。细胞生长、引发条件、培养物提取和产物分析基本上如实施例11 中记载的那样进行。

如图11所示,可通过P450-BM3双突变体将(+)-α-檀香萜、 (-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜高效地氧化,从而生产(Z)-α-檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β- 檀香醇。值得一提的是,在该实验条件下,只能检测到倍半萜醇的所期望的顺式异构体。这些数据显示出巨大芽孢杆菌(Bacillus megaterium)CYP102A1(P450-BM3)可高效地改造以选择性地使 (+)-α-檀香萜、(-)-β-檀香萜以及其它结构相关的萜烯(比如香柠檬烯倍半萜)的顺式末端碳羟基化,从而生产出发现于檀香油中的关键的倍半萜醇。

从B&T World Seeds(Aigues-Vives,France)和Sandeman Seeds (Lalongue,France)获得檀香树的种子。通过2.5%的次氯酸(HCIO)对种子进行120分钟的第一次表面灭菌,并且在灭菌超纯水中洗涤三次。然后将种子去壳并置于添加了15克/升的蔗糖和7.8g/L 的琼脂、pH值5.7的MS基础培养基上(Murashige&Skoog,1962, PhysiologiaPlantarum 15,473-497)。在9~18天后,通常观察到大约40%的发芽率。在发芽后5~10周,将从无菌发芽的种子中获得的檀香幼苗转移到土壤中。由于檀香品种是根半寄生物,因此土壤适于近距离接触6个月~1岁龄的柑桔(甜橙Citrus sinensis)植物。收获檀香植物的根,并且将其转移至土壤中2-3年后,从宿主植物的根处分离。这些根的提取物的GC-MS分析显示出檀香油所特有的倍半萜的存在。采用Concert

采用Illumina总RNA测序技术和Illumina HiSeq 2000测序仪来对整体转录物进行测序。产生出108.7百万个配对read 2×100 bp。采用CLC-生物基因组工作平台的DeNovo整合应用程序 (CLCBo,Denmark)来整合测定序列。总共将平均长度683bp的 82’479个contig整合。采用tBlastn algorithm来检索这些contig (Altschul et al,J.Mol.Biol.215,403-410,1990),并且用例如CYP71AV1 序列(NCBI登录号ABB82944.1)的已知P450氨基酸序列作为查询序列。本方式允许识别编码有特征性细胞色素P450基序的蛋白质的数个contig。某个选定的contig SCH37-Ct816(SED ID NO 69) 包含编码500个氨基酸的蛋白SaCP816(SEQ ID NO 71)的1503bp 长度的开放阅读框架(ORF)(SEQ ID NO 70)。该氨基酸显示出与已知的细胞色素P450序列(最为接近的源自欧亚种葡萄(Vitisvinifera) 的P450,CYP71D 10(NCBI登录号AAB94588.1))具有同源性,具有62%的氨基酸序列同一性。

作为通过SCH37-Ct816编码的蛋白质的功能特性,该蛋白质在大肠杆菌细胞中异源表达。对ORF序列进行修饰来提高在大肠杆菌内的表达:用编码MALLLAVFWSALIILV肽的密码子来代替前17个密码子,并且对整个ORF序列的密码子使用进行修饰以便匹配大肠杆菌密码子使用。用于编码经修饰的SaCP816(SEQ ID NO 73)的本cDNA SaCP120293(SEQ ID NO72)在体外合成 (DNA2.0)并克隆到pJExpress404质粒(DNA2.0)中。如实施例2所述那样进行异源表达。

双顺反子操纵子被设计为在独特的启动子的控制下从单一的质粒中表达P450酶和CPR。优化的SaCP120293cDNA与CPRm cDNA(SEQ ID No 9,实施例3)结合以便制备顺序包含P450 cDNA、包括核糖体结合位点(RBS)的接头序列和CPRm cDNA的双顺反子结构(SEQ IDNO 74)。该结构是通过用PCR分别扩增 P450和CPR cDNAs来制备的,并具有5’和3’突出端,其适于在 pCWori+质粒(Barnes H.J(1996)Method Enzymol.272,3-14)的 NdeI-HindIII位点中采用In-

JM109大肠杆菌细胞用SaCP816-CPRm-pCWori表达质粒进行转化。经转化的细胞进行生长,并且如实施例2所述那样制备包含重组蛋白的无细胞提取物。该蛋白组分用于倍半萜分子的酶促转化的评价(实施例16)。

如实施例4所述那样制备在生物转化实验中用作底物的不同倍半萜烃。

将提取自表达重组SaCP816和CPRm蛋白(实施例15)的大肠杆菌细胞的粗蛋白用于这些倍半萜分子的体外氧化。在包含20~50 微升蛋白质提取物、500微M NADPH(还原了的烟酰胺腺嘌呤二核苷酸磷酸)、5微M FAD(黄素腺嘌呤二核苷酸)、5微M FMN (黄素单核苷酸)和300微M倍半萜(即(α)-檀香萜或(+)-α-檀香萜、 (-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的混合物)的1 mL的100mM Tris-HCL pH 7.4缓冲液中进行该实验。在聚四氟乙烯密封玻璃管中伴随温和搅拌温育2个小时后,在冰上停止反应,并通过1体积MTBE(甲基叔丁基醚,Sigma)来萃取。如实施例4所述那样通过GCMS来分析提取物。

在这些条件下,观察到了(+)-α-檀香萜、(-)-β-檀香萜、(-)-α- 反-香柠檬烯和(+)-表-β-檀香萜的氧化。图12表示通过SaCP816 对(+)-α-檀香萜进行氧化来提供(Z)-α-檀香醇的情况。图13表示通过SaCP816来氧化(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜生成(Z)-α-檀香醇、(Z)-β-檀香醇、(Z)-α-反-香柠檬醇和(Z)-表-β-檀香醇的情况。在所有实验中,均没有观测到可检测量的倍半萜醇的相应反式异构体(每种倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离)。

本实验显示出,从檀香树(Santalum album)分离的细胞色素 P450酶SaCP816可用于选择性羟基化(+)-α-檀香萜、(-)-β-檀香萜和相似倍半萜结构的顺式末端碳。

(+)-α-檀香萜以及(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯、(+)-表-β-檀香萜或其它相似结构的分子的氧化产物均可直接产自被改造成从比如葡萄糖或甘油的碳源来生产倍半萜的大肠杆菌细胞。制备由包含合成操纵子的pCWori+质粒构成的质粒,该合成操纵子由SaCP120293cDNA(SEQ ID No 72)、CPRm cDNA (SEQ ID No 9)、萜类合成酶编码cDNA构成。作为萜类合成酶,使用黄皮(Clausena lansium)(+)-α-檀香萜合酶cDNA(ClASS) (WO2009109597)或用于编码檀香树(Santalum album)(+)-α-檀香萜 /(-)-β-檀香萜合酶(SaSAS)(WO2010067309)的cDNA。

通过与如实施例6所述的工序相似的工序来构建出两种质粒。如实施例6那样扩增密码子优化的(+)-α-檀香萜合酶cDNA(SEQ ID NO 19)和(+)-α-檀香萜/(-)-β-檀香萜合酶cDNA(SEQ ID NO 21),并采用In-

将这两种质粒中任一种与带来完整的甲羟戊酸途径的质粒 pACYC-29258-4506共转化到大肠杆菌XRX细胞(Promega)中进行这些操纵子的性能评价(实施例4)。从羧苄青霉素(50μg/ml)和氯霉素(34μg/ml)的LB-琼脂糖平板上选出经转化的细胞。使用单个菌落接种到补充有合适的抗生素的5毫升LB培养基中。将培养物以250rpm在37℃下温育过夜。第二天,将在玻璃培养管中包含 100μg/L羧苄青霉素和17μg/l氯霉素的2mL TB培养基接种到200μl LB预培养物,并且以250rpm并在37℃下温育。在6个小时的培养后(或当培养物在600nm的光密度达到值3时),将培养物冷却至20℃,并且通过添加0.1mM IPTG(异丙基β-D-1-硫代半乳糖苷)、δ-氨基乙酰丙酸(Sigma)和2%(v/v)的癸烷来引起蛋白质的表达。在伴随250rpm的摇动的温育48小时后,如实施例4所述,通过1体积的MTBE来提取整个培养液并通过GCMS 进行分析。

在体外实验中,也发现了所有所得的菌株生产出倍半萜烃和相应的含氧产物(图14)。该实验显示出使用表达SaCP816的工程细胞,可生产出倍半萜(Z)-α-檀香醇、(Z)-β-檀香醇和其它相似结构的分子。

如实施例13所述,在转录自檀香树(Santalum album)根中确认多种P450-编码contig序列。除SCH37-Ct816以外,选出另一 contig序列:SCH37-

作为通过SCH37-Ct10374编码的蛋白质的功能特性,该蛋白质在大肠杆菌细胞中异源表达。对ORF序列进行修饰来提高在大肠杆菌内的表达:用编码MALLLAVFWSALII肽的密码子来代替前18个密码子,并且对整个ORF序列的密码子使用进行修饰以便匹配大肠杆菌密码子使用。用于编码经修饰的SaCP10374(SEQ ID NO 81)的新型cDNA SaCP120292(SEQID NO 80)在体外合成 (DNA2.0)并克隆到pJExpress404质粒(DNA2.0)中。

如实施例2所述那样进行异源表达。按照此步骤,测定新型重组檀香树(S.abum)P450在450nm处具有最大吸光度的典型的CO 光谱,验证了正确折叠形成有功能P450酶。

为了重新构建该P450酶的活性,共表达P450还原酶。为此目的,以如实施例15所述的类似方式设计双顺反子操纵子,以便在独特的启动子的控制下从单个质粒中表达SaCP10374和CPRm (薄荷P450还原酶)。优化的SaCP12092 cDNA与CPRm cDNA 结合以便制备顺序包含P450 cDNA、包括核糖体结合位点(RBS) 的接头序列和CPRm cDNA的双顺反子结构(SEQ ID NO 82)。该结构是通过如实施例15所述的PCR制备的。并且将其克隆到 pCWori+质粒(Barnes H.J(1996)Method Enzymol.272,3-14)中以提供质粒SaCP10374-CPRm-pCWori。

JM109大肠杆菌细胞用这些双顺反子表达质粒进行转化。经转化的细胞进行生长,并且如实施例2所述那样制备包含重组蛋白的无细胞提取物。这些膜蛋白组分用于倍半萜分子的酶促转化的评价(实施例21)。

如实施例4所述那样制备在生物转化实验中用作底物的不同的倍半萜烃(即(α)-檀香萜或(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反- 香柠檬烯和(+)-表-β-檀香萜的混合物)。

将提取自表达重组SaCP10374和CPRm蛋白(实施例20)的大肠杆菌细胞的粗蛋白用于倍半萜分子的体外氧化,如实施例16所述那样进行实验。在聚四氟乙烯密封玻璃管中伴随温和搅拌而温育2个小时后,在冰上停止反应并通过1体积MTBE(甲基叔丁基醚,Sigma)来萃取。如实施例4所述那样通过GCMS来分析提取物。

在这些条件下,观察到了通过SaCP10374进行的(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反-香柠檬烯和(+)-表-β-檀香萜的氧化。图 15、16表示通过SaCP10374对(+)-α-檀香萜、(-)-β-檀香萜、(-)-α- 反-香柠檬烯和(+)-表-β-檀香萜进行氧化来提供(E)-α-檀香醇、 (E)-β-檀香醇、(E)-α-反-香柠檬醇和(E)-表-β-檀香醇的情况。在所有实验中,均没有观察到可检测量的倍半萜醇的相应的反式异构体(每种倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离)。

本实验显示出,从檀香树(Santalum album)分离的细胞色素 P450酶SaCP10374可用于选择性羟基化(+)-α-檀香萜、(-)-β-檀香萜和相似倍半萜结构的顺式末端碳。

采用如实施例4的方法,制备出类似于檀香萜的多种倍半萜烃。采用包含编码檀香树(Santalum album)(-)-倍半香桧烯B合酶 (NCBI登录号ADP37190.1)SaTps647的cDNA或编码檀香树 (Santalum album)(-)-β-甜没药烯合酶(NCBI登录号ADP37189.1) SaTps30的cDNA中任一种的pETDuet表达质粒与如实施例4所述的pACYC-29258-4506质粒的结合,来生产(-)-倍半香桧烯B和 (-)-β-甜没药烯。从Bedoukian(Dambury,Ct,USA)获得β-法呢烯,从Treatt(Suffolk,UK)获得α-法呢烯,并且从柑橘油中提纯出(-)-α- 反-香柠檬烯。

将提取自表达重组SaCP816或SaCP10374的大肠杆菌细胞的粗蛋白连同CPRm蛋白(实施例15和20)用于倍半萜分子的体外氧化。如实施例16所述那样进行通过GCMS分析的实验和产物确定。

在这些条件下,观察到了(E)-β-法呢烯、(E)-α-法呢烯、(-)-倍半香桧烯B、(-)-β-甜没药烯和(-)-α-反-香柠檬烯的氧化(图17~21)。对于所有这些化合物,檀香树(S.album)P450s对于终端偕二甲基基团(图27中的R1或R2)的两个碳原子中的其中一个具有区域选择性。SaCP816催化在相对于末端双键的顺式位置(图27的R1)的甲基碳原子的选择性氧化,然而SaCP10374仅在相对于末端双键的反式位置(图27中的R2)的甲基基团的碳原子处催化相同底物的氧化。各倍半萜醇的反式和顺式异构体在用于这些实验的色谱条件下均很容易被分离。因大肠杆菌内源乙醇脱氢酶活性,当反式甲基氧化时,那么会形成相应的醛。

这些实验显示出,分离自檀香树(Santalum album)的细胞色素 P450酶、SaCP816和SaCP10374可分别用于选择性羟基化具有类似于β-法呢烯、α-法呢烯、(+)-α-檀香萜、(-)-β-檀香萜、(-)-α-反- 香柠檬烯、(-)-倍半香桧烯B或(-)-β-甜没药烯的结构的各种倍半萜分子的顺式末端和反式末端的碳。

如实施例21和22所述的氧化的倍半萜分子也可使用被改造成从比如葡萄糖或甘油的碳源生产倍半萜的大肠杆菌细胞中直接生产出。制备出由包含合成操纵子的pCWori+质粒构成的质粒,该合成操纵子由SaCP120293cDNA(SEQ ID No 72)或SaCP120292 (SEQ IDNo 80)、CPRm cDNA(SEQ ID No 9)、萜类合成酶编码 cDNA(编码黄花蒿(Artemisia annua)β-法呢烯合酶cDNA(NCBI登录号AAX39387.1.1)、云杉(Picea abies)α-法呢烯合酶(NCBI登录号AAS47697.1)、檀香树(S.album)(-)-倍半香桧烯B(NCBI登录号ADP37190.1)、檀香树(S.album)(-)-β-甜没药烯合酶(NCBI登录号ADP37189.1)、黄皮(Clausena lansium)α-檀香萜合酶(NCBI登录号ADR71055.1)或檀香树(S.album)α-/β-檀香萜合酶(NCBI登录号ADP30867.1))构成。

带有合成操纵子的不同组合的质粒按照如下工序来制备。分别通过(E)-β-法呢烯合酶和(E)-α-法呢烯合酶cDNAs以PCR方式扩增质粒pD444-SR-AaBFS(包含编码黄花蒿(Artemisia annua) (E)-β-法呢烯合酶(NCBI登录号AAX39387.1)AaBFS的优化的 cDNA)、质粒pD444-SR-PaAFS(包含编码云杉(Picea abies)(E)-α- 法呢烯合酶(NCBI登录号AAS47697.1)PaAFS的优化的cDNA)。将质粒pETDuet-SaTps647和pETDuet-SaTps30(实施例22)作为模板,并且分别通过倍半香桧烯B合酶和甜没药烯合酶cDNAs以 PCR的方式扩增。对于每个结构,设计引物用于采用In-

将PCR产物连接到采用HindIII restriction酶消解的质粒 SaCP816-CPRm-pCWori(SEQ ID No 74)或 SaCP10374-CPRm-pCWOri(SEQ ID NO 82)中,并且采用 In-

如实施例17所述那样采用上述质粒在大肠杆菌细胞中进行含氧倍半萜的体内生产。在体外实验中也可观察到,所有用这些质粒转化的重组细菌细胞生产出了所期望的倍半萜烃以及相应的含氧产物(图22~26)。

序列表

<110> 弗门尼舍有限公司

<120> 用于生产芳香醇的方法

<130> P219547WO

<150> US 61880149

<151> 2013-09-19

<160> 104

<170> PatentIn version 3.5

<210> 1

<211> 1500

<212> DNA

<213> 菊苣(Cichorium intybus)

<400> 1

atggagattt ctatccccac tacccttggc cttgccgtca tcatcttcat cattttcaag 60

ttgctaacgc gtaccacatc aaagaaaaac ctactcccag agccatggag actaccaata 120

atcggacaca tgcatcatct gataggtacg atgccacatc gtggtgtcat ggaactagcc 180

aggaagcatg gatctctcat gcatctacaa cttggagaag tgtccactat tgtggtctca 240

tccccacgtt gggcaaaaga ggttctgaca acgtacgata ttacgtttgc aaacagaccg 300

gagactttaa ccggtgagat tgttgcatat cacaataccg atattgtcct tgctccgtat 360

ggtgaatact ggaggcagtt gcgaaagctt tgcaccttgg agcttttaag caacaagaaa 420

gtgaagtcgt ttcagtccct tcgtgaggag gaatgttgga atctggttaa agacattcga 480

tcaactgggc agggatcccc aatcaatctt tcagaaaaca ttttcaagat gattgccacc 540

atacttagta gggcagcatt cggaaaggga atcaaagacc aaatgaaatt tacagaatta 600

gtaaaagaaa tactaaggct tacgggaggt tttgatgtgg cggacatctt tccttctaaa 660

aagttacttc accatctttc aggcaagaga gctaagttaa ccaacataca caataagctt 720

gacaatttga tcaacaatat catcgctgag caccctggaa accgtacaag ctcatcacag 780

gagactctac ttgatgttct gttaagactg aaagaaagcg cagagtttcc attgacagca 840

gacaatgtca aagcagtcat tttggatatg tttggagctg gcacggatac ttcgtcagcc 900

acaattgaat gggcaatctc agaattgata aggtgtccga gagccatgga gaaagttcaa 960

acagaattaa ggcaagcact aaatggaaag gaaaggatcc aagaagaaga tctacaggaa 1020

ctaaattacc taaagctagt gatcaaagaa acattgaggt tgcatccacc actaccgttg 1080

gttatgccta gagagtgtag ggagccatgt gtgttggggg gatacgatat acccagcaag 1140

acgaaactta ttgtcaacgt gtttgccata aacagggatc ctgaatactg gaaagatgct 1200

gaaactttca tgccagagag atttgaaaac agccccatca ctgtaatggg ttcagagtat 1260

gagtatctcc cgtttggtgc aggaagaaga atgtgtccag gcgctgccct tggtttagcc 1320

aacgtggaac ttcctcttgc tcatatactt tactacttca attggaagct cccaaatgga 1380

aaaacatttg aagacttgga catgactgag agctttggag ccactgtcca aagaaagacg 1440

gagttgttac tagttccaac ggatttccaa acacttacgg catctactta atgactcgag 1500

<210> 2

<211> 496

<212> PRT

<213> 菊苣(Cichorium intybus)

<400> 2

Met Glu Ile Ser Ile Pro Thr Thr Leu Gly Leu Ala Val Ile Ile Phe

1 5 10 15

Ile Ile Phe Lys Leu Leu Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu

20 25 30

Pro Glu Pro Trp Arg Leu Pro Ile Ile Gly His Met His His Leu Ile

35 40 45

Gly Thr Met Pro His Arg Gly Val Met Glu Leu Ala Arg Lys His Gly

50 55 60

Ser Leu Met His Leu Gln Leu Gly Glu Val Ser Thr Ile Val Val Ser

65 70 75 80

Ser Pro Arg Trp Ala Lys Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe

85 90 95

Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu Ile Val Ala Tyr His Asn

100 105 110

Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg

115 120 125

Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn Lys Lys Val Lys Ser Phe

130 135 140

Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn Leu Val Lys Asp Ile Arg

145 150 155 160

Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys

165 170 175

Met Ile Ala Thr Ile Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys

180 185 190

Asp Gln Met Lys Phe Thr Glu Leu Val Lys Glu Ile Leu Arg Leu Thr

195 200 205

Gly Gly Phe Asp Val Ala Asp Ile Phe Pro Ser Lys Lys Leu Leu His

210 215 220

His Leu Ser Gly Lys Arg Ala Lys Leu Thr Asn Ile His Asn Lys Leu

225 230 235 240

Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu His Pro Gly Asn Arg Thr

245 250 255

Ser Ser Ser Gln Glu Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Glu

260 265 270

Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn Val Lys Ala Val Ile Leu

275 280 285

Asp Met Phe Gly Ala Gly Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp

290 295 300

Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg Ala Met Glu Lys Val Gln

305 310 315 320

Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu

325 330 335

Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu

340 345 350

Arg Leu His Pro Pro Leu Pro Leu Val Met Pro Arg Glu Cys Arg Glu

355 360 365

Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile

370 375 380

Val Asn Val Phe Ala Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala

385 390 395 400

Glu Thr Phe Met Pro Glu Arg Phe Glu Asn Ser Pro Ile Thr Val Met

405 410 415

Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys

420 425 430

Pro Gly Ala Ala Leu Gly Leu Ala Asn Val Glu Leu Pro Leu Ala His

435 440 445

Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu

450 455 460

Asp Leu Asp Met Thr Glu Ser Phe Gly Ala Thr Val Gln Arg Lys Thr

465 470 475 480

Glu Leu Leu Leu Val Pro Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr

485 490 495

<210> 3

<211> 1473

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-65188 DNA 序列

<400> 3

atggcactct tactggcagt attctggtcc gccctgatca ttcttgtaac ccgcacgact 60

agcaaaaaga atctgttgcc ggagccatgg cgtctgccga ttatcggtca catgcaccat 120

ttgatcggca ccatgccgca tcgtggtgtt atggaactgg cccgtaagca tggcagcctg 180

atgcacctgc aactgggtga agtctctacg attgttgtca gcagcccgcg ttgggcgaaa 240

gaggtcttga ccacctatga tatcaccttc gccaatcgcc cggaaaccct gactggcgag 300

atcgtcgcat accacaacac ggatatcgtc ctggcgccgt atggtgagta ttggcgtcaa 360

ctgcgtaaac tgtgcacgct ggagctgctg agcaacaaga aagtgaagag cttccagagc 420

ctgcgcgaag aagagtgttg gaacctggtc aaggacatcc gcagcaccgg ccaaggtagc 480

ccaatcaatc tgtcggagaa cattttcaag atgattgcga cgattctgag ccgtgctgcg 540

ttcggtaagg gtattaagga tcaaatgaag tttaccgaac tggtgaaaga aatcctgcgt 600

ctgaccggcg gttttgatgt cgctgacatc ttccctagca agaagttgct gcaccacctg 660

agcggcaagc gtgcaaaact gaccaatatc cataacaagc tggataatct gatcaataac 720

atcatcgcag agcacccggg caaccgtacc tcgtcctccc aggaaacgct gctggacgtt 780

ctgctgcgcc tgaaagagtc tgcggagttt ccgctgaccg ccgacaacgt taaagcagtg 840

atcctggata tgttcggcgc tggtacggat accagcagcg cgacgatcga gtgggcgatt 900

agcgagctga ttcgctgccc tcgcgcgatg gagaaagtgc agacggaatt gcgtcaggca 960

ctgaatggca aagagcgtat tcaggaagag gatttgcagg agctgaatta tctgaagctg 1020

gtgattaaag aaaccctgcg cctgcatccg ccgttgccgc tggtgatgcc gcgtgagtgc 1080

cgtgaaccgt gtgttttggg cggttacgac attccgagca aaacgaagct gatcgttaat 1140

gttttcgcga ttaaccgtga cccggaatac tggaaagacg cggaaacgtt tatgccggag 1200

cgttttgaga atagcccgat taccgttatg ggttccgagt acgaatacct gccatttggt 1260

gctggtcgtc gtatgtgtcc tggtgcagcg ctgggtctgg ccaacgtgga actgccgctg 1320

gcgcacattc tgtactattt caactggaaa ctgccgaacg gcaagacctt cgaagatttg 1380

gacatgaccg agagctttgg tgccactgtg cagcgcaaaa ccgagctgct gctggttccg 1440

accgactttc aaacgctgac tgcgagcacc taa 1473

<210> 4

<211> 490

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-65188 氨基酸序列

<400> 4

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Thr Arg Thr Thr Ser Lys Lys Asn Leu Leu Pro Glu Pro Trp Arg Leu

20 25 30

Pro Ile Ile Gly His Met His His Leu Ile Gly Thr Met Pro His Arg

35 40 45

Gly Val Met Glu Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln

50 55 60

Leu Gly Glu Val Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys

65 70 75 80

Glu Val Leu Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr

85 90 95

Leu Thr Gly Glu Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala

100 105 110

Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu

115 120 125

Leu Leu Ser Asn Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu

130 135 140

Glu Cys Trp Asn Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser

145 150 155 160

Pro Ile Asn Leu Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu

165 170 175

Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr

180 185 190

Glu Leu Val Lys Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala

195 200 205

Asp Ile Phe Pro Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg

210 215 220

Ala Lys Leu Thr Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn

225 230 235 240

Ile Ile Ala Glu His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr

245 250 255

Leu Leu Asp Val Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu

260 265 270

Thr Ala Asp Asn Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly

275 280 285

Thr Asp Thr Ser Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile

290 295 300

Arg Cys Pro Arg Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala

305 310 315 320

Leu Asn Gly Lys Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn

325 330 335

Tyr Leu Lys Leu Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu

340 345 350

Pro Leu Val Met Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly

355 360 365

Tyr Asp Ile Pro Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile

370 375 380

Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu

385 390 395 400

Arg Phe Glu Asn Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr

405 410 415

Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly

420 425 430

Leu Ala Asn Val Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn

435 440 445

Trp Lys Leu Pro Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu

450 455 460

Ser Phe Gly Ala Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro

465 470 475 480

Thr Asp Phe Gln Thr Leu Thr Ala Ser Thr

485 490

<210> 5

<211> 1509

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-P2 DNA 序列

<400> 5

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500

agcacctaa 1509

<210> 6

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-P2 氨基酸序列

<400> 6

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 7

<211> 1509

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-P2O DNA 序列

<400> 7

atggcactgt tgctggctgt cttttggtct gctctgatta ttttggtggt tacctacacc 60

atctccctgc tgattaacca gtggcgtaaa ccgaaaccac agggtaaatt cccgccgggt 120

ccgtggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500

agcacctaa 1509

<210> 8

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-P2O 氨基酸序列

<400> 8

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 9

<211> 2133

<212> DNA

<213> 椒样薄荷(Mentha piperita)

<400> 9

atggaaccta gctctcagaa actgtctccg ttggaatttg ttgctgctat cctgaagggc 60

gactacagca gcggtcaggt tgaaggtggt ccaccgccag gtctggcagc tatgttgatg 120

gaaaataagg atttggtgat ggttctgacg acgtccgtgg cagtcctgat cggctgtgtc 180

gtggtcctgg catggcgtcg tgcggcaggt agcggtaagt acaagcaacc tgaactgcct 240

aaactggtgg tcccgaaagc agccgaaccg gaggaggcag aggatgataa aaccaagatc 300

agcgtgtttt tcggcaccca aaccggtacg gcagaaggtt tcgcgaaggc ttttgttgaa 360

gaggccaagg cgcgttatca gcaggcccgt ttcaaagtta tcgacctgga cgactatgcg 420

gcagacgatg acgagtacga agagaaactg aagaaggaaa acttggcatt cttcttcttg 480

gcgtcctacg gtgacggcga gccgacggac aacgcggcac gcttttacaa atggtttacg 540

gagggtaagg accgtggtga atggctgaac aatctgcagt acggcgtttt tggtctgggt 600

aaccgtcaat atgagcattt caataagatc gccattgtcg tcgatgatct gatcttcgag 660

caaggtggca agaagctggt tccggtgggt ctgggtgacg atgaccagtg cattgaggat 720

gattttgcgg cgtggcgtga actggtctgg ccggaactgg ataaactgct gcgtaacgaa 780

gacgacgcta ccgtggcaac cccgtacagc gccgctgtgc tgcaataccg cgtggttttc 840

cacgatcaca ttgacggcct gattagcgaa aacggtagcc cgaacggtca tgctaatggc 900

aataccgtgt acgatgcgca acacccgtgc cgtagcaacg tcgcggtcaa gaaggaattg 960

catactccgg cgagcgatcg cagctgcacc cacctggaat ttaacattag cggtaccggc 1020

ctgatgtacg agacgggtga ccacgtcggt gtgtattgcg agaacctgtt ggaaaccgtg 1080

gaggaggccg agaagttgtt gaacctgagc ccgcagacgt acttctccgt tcacaccgac 1140

aacgaggacg gtacgccgtt gagcggcagc agcctgccgc caccgtttcc gccgtgcacc 1200

ttgcgcacgg cattgaccaa atacgcagac ttgacttctg caccgaaaaa gtcggtgctg 1260

gtggcgctgg ccgagtacgc atctgaccag ggtgaagcgg atcgtttgcg tttcttggcg 1320

agcccgagcg gcaaagagga atatgcacag tacatcttgg caagccagcg cacgctgctg 1380

gaggtcatgg cggagttccc gtcggcgaaa ccgccgctgg gtgtcttttt cgcgggtgtc 1440

gctccgcgcc tgcagccgcg tttctattcc attagctcta gcccgaagat cgcaccgttc 1500

cgtattcacg tgacctgcgc cctggtttat gacaaatccc ctaccggtcg cgttcataag 1560

ggcatctgta gcacgtggat gaaaaatgcg gtcccgctgg aagaaagcaa cgattgttcc 1620

tgggctccga tcttcgtccg caacagcaac ttcaagctgc cgaccgaccc gaaggttccg 1680

attatcatga ttggtccggg taccggtctg gccccttttc gtggcttttt gcaagagcgc 1740

ttggcgttga aagagagcgg tgctgaattg ggtccggcga tcttgttctt tggttgccgt 1800

aaccgtaaaa tggactttat ttacgaggat gaactgaatg atttcgtcaa agcgggcgtt 1860

gtcagcgagc tgatcgtcgc ttttagccgc gaaggcccga tgaaagaata cgtgcaacac 1920

aaaatgagcc aacgtgcctc cgatgtgtgg aacatcatta gcgacggtgg ttatgtttat 1980

gtttgcggtg acgcgaaggg tatggctcgt gatgttcacc gtaccctgca taccatcgca 2040

caggagcaag gtagcatgtc cagctcggag gccgaaggta tggtcaaaaa cctgcaaacc 2100

accggtcgtt acctgcgtga tgtgtggtaa taa 2133

<210> 10

<211> 709

<212> PRT

<213> 椒样薄荷(Mentha piperita)

<400> 10

Met Glu Pro Ser Ser Gln Lys Leu Ser Pro Leu Glu Phe Val Ala Ala

1 5 10 15

Ile Leu Lys Gly Asp Tyr Ser Ser Gly Gln Val Glu Gly Gly Pro Pro

20 25 30

Pro Gly Leu Ala Ala Met Leu Met Glu Asn Lys Asp Leu Val Met Val

35 40 45

Leu Thr Thr Ser Val Ala Val Leu Ile Gly Cys Val Val Val Leu Ala

50 55 60

Trp Arg Arg Ala Ala Gly Ser Gly Lys Tyr Lys Gln Pro Glu Leu Pro

65 70 75 80

Lys Leu Val Val Pro Lys Ala Ala Glu Pro Glu Glu Ala Glu Asp Asp

85 90 95

Lys Thr Lys Ile Ser Val Phe Phe Gly Thr Gln Thr Gly Thr Ala Glu

100 105 110

Gly Phe Ala Lys Ala Phe Val Glu Glu Ala Lys Ala Arg Tyr Gln Gln

115 120 125

Ala Arg Phe Lys Val Ile Asp Leu Asp Asp Tyr Ala Ala Asp Asp Asp

130 135 140

Glu Tyr Glu Glu Lys Leu Lys Lys Glu Asn Leu Ala Phe Phe Phe Leu

145 150 155 160

Ala Ser Tyr Gly Asp Gly Glu Pro Thr Asp Asn Ala Ala Arg Phe Tyr

165 170 175

Lys Trp Phe Thr Glu Gly Lys Asp Arg Gly Glu Trp Leu Asn Asn Leu

180 185 190

Gln Tyr Gly Val Phe Gly Leu Gly Asn Arg Gln Tyr Glu His Phe Asn

195 200 205

Lys Ile Ala Ile Val Val Asp Asp Leu Ile Phe Glu Gln Gly Gly Lys

210 215 220

Lys Leu Val Pro Val Gly Leu Gly Asp Asp Asp Gln Cys Ile Glu Asp

225 230 235 240

Asp Phe Ala Ala Trp Arg Glu Leu Val Trp Pro Glu Leu Asp Lys Leu

245 250 255

Leu Arg Asn Glu Asp Asp Ala Thr Val Ala Thr Pro Tyr Ser Ala Ala

260 265 270

Val Leu Gln Tyr Arg Val Val Phe His Asp His Ile Asp Gly Leu Ile

275 280 285

Ser Glu Asn Gly Ser Pro Asn Gly His Ala Asn Gly Asn Thr Val Tyr

290 295 300

Asp Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys Glu Leu

305 310 315 320

His Thr Pro Ala Ser Asp Arg Ser Cys Thr His Leu Glu Phe Asn Ile

325 330 335

Ser Gly Thr Gly Leu Met Tyr Glu Thr Gly Asp His Val Gly Val Tyr

340 345 350

Cys Glu Asn Leu Leu Glu Thr Val Glu Glu Ala Glu Lys Leu Leu Asn

355 360 365

Leu Ser Pro Gln Thr Tyr Phe Ser Val His Thr Asp Asn Glu Asp Gly

370 375 380

Thr Pro Leu Ser Gly Ser Ser Leu Pro Pro Pro Phe Pro Pro Cys Thr

385 390 395 400

Leu Arg Thr Ala Leu Thr Lys Tyr Ala Asp Leu Thr Ser Ala Pro Lys

405 410 415

Lys Ser Val Leu Val Ala Leu Ala Glu Tyr Ala Ser Asp Gln Gly Glu

420 425 430

Ala Asp Arg Leu Arg Phe Leu Ala Ser Pro Ser Gly Lys Glu Glu Tyr

435 440 445

Ala Gln Tyr Ile Leu Ala Ser Gln Arg Thr Leu Leu Glu Val Met Ala

450 455 460

Glu Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala Gly Val

465 470 475 480

Ala Pro Arg Leu Gln Pro Arg Phe Tyr Ser Ile Ser Ser Ser Pro Lys

485 490 495

Ile Ala Pro Phe Arg Ile His Val Thr Cys Ala Leu Val Tyr Asp Lys

500 505 510

Ser Pro Thr Gly Arg Val His Lys Gly Ile Cys Ser Thr Trp Met Lys

515 520 525

Asn Ala Val Pro Leu Glu Glu Ser Asn Asp Cys Ser Trp Ala Pro Ile

530 535 540

Phe Val Arg Asn Ser Asn Phe Lys Leu Pro Thr Asp Pro Lys Val Pro

545 550 555 560

Ile Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg Gly Phe

565 570 575

Leu Gln Glu Arg Leu Ala Leu Lys Glu Ser Gly Ala Glu Leu Gly Pro

580 585 590

Ala Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Met Asp Phe Ile Tyr

595 600 605

Glu Asp Glu Leu Asn Asp Phe Val Lys Ala Gly Val Val Ser Glu Leu

610 615 620

Ile Val Ala Phe Ser Arg Glu Gly Pro Met Lys Glu Tyr Val Gln His

625 630 635 640

Lys Met Ser Gln Arg Ala Ser Asp Val Trp Asn Ile Ile Ser Asp Gly

645 650 655

Gly Tyr Val Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Arg Asp Val

660 665 670

His Arg Thr Leu His Thr Ile Ala Gln Glu Gln Gly Ser Met Ser Ser

675 680 685

Ser Glu Ala Glu Gly Met Val Lys Asn Leu Gln Thr Thr Gly Arg Tyr

690 695 700

Leu Arg Asp Val Trp

705

<210> 11

<211> 1992

<212> DNA

<213> 黄花蒿(Artemisia annua)

<400> 11

atggcactgg acaaactgga cctgtacgta atcatcacct tagtcgtcgc cgtggccgcg 60

tattttgcga aaaatcgccg ctcgtctagc gcagccaaga aagccgcgga gagcccggtt 120

attgtcgtcc cgaagaaggt tacggaggac gaagtggacg acggtcgtaa aaaggtcacg 180

gtgttcttcg gcacgcagac tggtaccgct gaaggtttcg cgaaggcgct ggttgaagaa 240

gcaaaggcgc gctatgaaaa ggcagtgttc aaggttatcg atctggacga ttacgccgca 300

gaggacgacg aatacgagga gaagttgaaa aaggagtccc tcgccttctt cttcctggcg 360

acgtacggcg atggtgagcc gaccgataac gcagctcgtt tctacaagtg gttcaccgag 420

ggtgaggaga agggtgagtg gctggataaa ctgcaatatg cggtctttgg tctgggcaac 480

cgccaatatg agcacttcaa taagatcgca aaggttgtgg atgagaaact ggtcgagcag 540

ggtgccaagc gcctggtgcc ggttggcatg ggtgatgacg atcagtgcat cgaggatgac 600

ttcaccgcct ggaaggagct ggtgtggccg gagctggacc aactgttgcg cgacgaagat 660

gacaccagcg ttgcgacgcc gtataccgcg gcagttggcg aatatcgtgt tgtttttcat 720

gataagccgg aaacctacga tcaggatcaa ctgaccaatg gtcatgctgt gcatgacgcg 780

cagcacccgt gcagaagcaa tgttgctgtt aagaaagaat tgcactctcc gctgtccgat 840

cgcagctgca cccacctgga atttgacatc agcaataccg gtttgagcta cgaaacgggc 900

gatcacgtcg gtgtgtatgt ggaaaatctg agcgaagttg tcgatgaggc tgagaagctg 960

atcggtttac caccgcacac ctacttcagc gtgcatactg acaatgagga tggcacccca 1020

ctgggcggtg ctagcctgcc accgcctttc ccgccttgca ccctgcgcaa agccctcgct 1080

agctacgctg atgtgctgag cagcccgaag aagagcgcac tgctggcact ggcagcacac 1140

gctaccgatt ccaccgaagc cgatcgcctg aagtttttcg ctagcccggc aggcaaggac 1200

gagtatgcgc agtggattgt cgcgagccac cgtagcctgc tggaagtgat ggaggcgttc 1260

ccgagcgcga agcctccgct cggcgtcttt ttcgcatcgg ttgcgcctcg cctgcaaccg 1320

cgttattact caatcagcag ctctccgaaa ttcgcgccga atcgtattca cgttacttgc 1380

gcgctggttt atgagcaaac tccgagcggt cgtgttcaca agggcgtttg ctctacctgg 1440

atgaaaaacg cggttcctat gacggagagc caagactgta gctgggctcc gatttatgtt 1500

cgcacgtcta actttcgcct gcctagcgac ccgaaggtgc cagtgattat gattggtccg 1560

ggtaccggtc tggcaccgtt ccgcggtttc ctgcaagaac gtctggcaca gaaagaagct 1620

ggtacggaat tgggcaccgc aattctgttc tttggttgtc gtaatcgtaa agtggacttt 1680

atctatgagg atgaactgaa caacttcgtg gaaaccggtg ccctgagcga attggtgacg 1740

gctttttctc gtgagggtgc gaccaaagaa tacgtgcagc acaagatgac gcagaaagca 1800

agcgacattt ggaatctgct gtccgaaggt gcgtacctgt atgtctgtgg cgacgcgaag 1860

ggcatggcaa aagacgttca ccgtaccctg cacaccattg tgcaggagca aggtagcctg 1920

gactcttcga aggcggaatt gtacgtcaaa aacctgcaaa tggccggtcg ttatctgcgt 1980

gacgtttggt aa 1992

<210> 12

<211> 663

<212> PRT

<213> 黄花蒿(Artemisia annua)

<400> 12

Met Ala Leu Asp Lys Leu Asp Leu Tyr Val Ile Ile Thr Leu Val Val

1 5 10 15

Ala Val Ala Ala Tyr Phe Ala Lys Asn Arg Arg Ser Ser Ser Ala Ala

20 25 30

Lys Lys Ala Ala Glu Ser Pro Val Ile Val Val Pro Lys Lys Val Thr

35 40 45

Glu Asp Glu Val Asp Asp Gly Arg Lys Lys Val Thr Val Phe Phe Gly

50 55 60

Thr Gln Thr Gly Thr Ala Glu Gly Phe Ala Lys Ala Leu Val Glu Glu

65 70 75 80

Ala Lys Ala Arg Tyr Glu Lys Ala Val Phe Lys Val Ile Asp Leu Asp

85 90 95

Asp Tyr Ala Ala Glu Asp Asp Glu Tyr Glu Glu Lys Leu Lys Lys Glu

100 105 110

Ser Leu Ala Phe Phe Phe Leu Ala Thr Tyr Gly Asp Gly Glu Pro Thr

115 120 125

Asp Asn Ala Ala Arg Phe Tyr Lys Trp Phe Thr Glu Gly Glu Glu Lys

130 135 140

Gly Glu Trp Leu Asp Lys Leu Gln Tyr Ala Val Phe Gly Leu Gly Asn

145 150 155 160

Arg Gln Tyr Glu His Phe Asn Lys Ile Ala Lys Val Val Asp Glu Lys

165 170 175

Leu Val Glu Gln Gly Ala Lys Arg Leu Val Pro Val Gly Met Gly Asp

180 185 190

Asp Asp Gln Cys Ile Glu Asp Asp Phe Thr Ala Trp Lys Glu Leu Val

195 200 205

Trp Pro Glu Leu Asp Gln Leu Leu Arg Asp Glu Asp Asp Thr Ser Val

210 215 220

Ala Thr Pro Tyr Thr Ala Ala Val Gly Glu Tyr Arg Val Val Phe His

225 230 235 240

Asp Lys Pro Glu Thr Tyr Asp Gln Asp Gln Leu Thr Asn Gly His Ala

245 250 255

Val His Asp Ala Gln His Pro Cys Arg Ser Asn Val Ala Val Lys Lys

260 265 270

Glu Leu His Ser Pro Leu Ser Asp Arg Ser Cys Thr His Leu Glu Phe

275 280 285

Asp Ile Ser Asn Thr Gly Leu Ser Tyr Glu Thr Gly Asp His Val Gly

290 295 300

Val Tyr Val Glu Asn Leu Ser Glu Val Val Asp Glu Ala Glu Lys Leu

305 310 315 320

Ile Gly Leu Pro Pro His Thr Tyr Phe Ser Val His Thr Asp Asn Glu

325 330 335

Asp Gly Thr Pro Leu Gly Gly Ala Ser Leu Pro Pro Pro Phe Pro Pro

340 345 350

Cys Thr Leu Arg Lys Ala Leu Ala Ser Tyr Ala Asp Val Leu Ser Ser

355 360 365

Pro Lys Lys Ser Ala Leu Leu Ala Leu Ala Ala His Ala Thr Asp Ser

370 375 380

Thr Glu Ala Asp Arg Leu Lys Phe Phe Ala Ser Pro Ala Gly Lys Asp

385 390 395 400

Glu Tyr Ala Gln Trp Ile Val Ala Ser His Arg Ser Leu Leu Glu Val

405 410 415

Met Glu Ala Phe Pro Ser Ala Lys Pro Pro Leu Gly Val Phe Phe Ala

420 425 430

Ser Val Ala Pro Arg Leu Gln Pro Arg Tyr Tyr Ser Ile Ser Ser Ser

435 440 445

Pro Lys Phe Ala Pro Asn Arg Ile His Val Thr Cys Ala Leu Val Tyr

450 455 460

Glu Gln Thr Pro Ser Gly Arg Val His Lys Gly Val Cys Ser Thr Trp

465 470 475 480

Met Lys Asn Ala Val Pro Met Thr Glu Ser Gln Asp Cys Ser Trp Ala

485 490 495

Pro Ile Tyr Val Arg Thr Ser Asn Phe Arg Leu Pro Ser Asp Pro Lys

500 505 510

Val Pro Val Ile Met Ile Gly Pro Gly Thr Gly Leu Ala Pro Phe Arg

515 520 525

Gly Phe Leu Gln Glu Arg Leu Ala Gln Lys Glu Ala Gly Thr Glu Leu

530 535 540

Gly Thr Ala Ile Leu Phe Phe Gly Cys Arg Asn Arg Lys Val Asp Phe

545 550 555 560

Ile Tyr Glu Asp Glu Leu Asn Asn Phe Val Glu Thr Gly Ala Leu Ser

565 570 575

Glu Leu Val Thr Ala Phe Ser Arg Glu Gly Ala Thr Lys Glu Tyr Val

580 585 590

Gln His Lys Met Thr Gln Lys Ala Ser Asp Ile Trp Asn Leu Leu Ser

595 600 605

Glu Gly Ala Tyr Leu Tyr Val Cys Gly Asp Ala Lys Gly Met Ala Lys

610 615 620

Asp Val His Arg Thr Leu His Thr Ile Val Gln Glu Gln Gly Ser Leu

625 630 635 640

Asp Ser Ser Lys Ala Glu Leu Tyr Val Lys Asn Leu Gln Met Ala Gly

645 650 655

Arg Tyr Leu Arg Asp Val Trp

660

<210> 13

<211> 3534

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-P2-aaCPR 插入DNA 序列

<400> 13

catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60

accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120

ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga cagaggaaga tataccatgg cactggacaa actggacctg 1560

tacgtaatca tcaccttagt cgtcgccgtg gccgcgtatt ttgcgaaaaa tcgccgctcg 1620

tctagcgcag ccaagaaagc cgcggagagc ccggttattg tcgtcccgaa gaaggttacg 1680

gaggacgaag tggacgacgg tcgtaaaaag gtcacggtgt tcttcggcac gcagactggt 1740

accgctgaag gtttcgcgaa ggcgctggtt gaagaagcaa aggcgcgcta tgaaaaggca 1800

gtgttcaagg ttatcgatct ggacgattac gccgcagagg acgacgaata cgaggagaag 1860

ttgaaaaagg agtccctcgc cttcttcttc ctggcgacgt acggcgatgg tgagccgacc 1920

gataacgcag ctcgtttcta caagtggttc accgagggtg aggagaaggg tgagtggctg 1980

gataaactgc aatatgcggt ctttggtctg ggcaaccgcc aatatgagca cttcaataag 2040

atcgcaaagg ttgtggatga gaaactggtc gagcagggtg ccaagcgcct ggtgccggtt 2100

ggcatgggtg atgacgatca gtgcatcgag gatgacttca ccgcctggaa ggagctggtg 2160

tggccggagc tggaccaact gttgcgcgac gaagatgaca ccagcgttgc gacgccgtat 2220

accgcggcag ttggcgaata tcgtgttgtt tttcatgata agccggaaac ctacgatcag 2280

gatcaactga ccaatggtca tgctgtgcat gacgcgcagc acccgtgcag aagcaatgtt 2340

gctgttaaga aagaattgca ctctccgctg tccgatcgca gctgcaccca cctggaattt 2400

gacatcagca ataccggttt gagctacgaa acgggcgatc acgtcggtgt gtatgtggaa 2460

aatctgagcg aagttgtcga tgaggctgag aagctgatcg gtttaccacc gcacacctac 2520

ttcagcgtgc atactgacaa tgaggatggc accccactgg gcggtgctag cctgccaccg 2580

cctttcccgc cttgcaccct gcgcaaagcc ctcgctagct acgctgatgt gctgagcagc 2640

ccgaagaaga gcgcactgct ggcactggca gcacacgcta ccgattccac cgaagccgat 2700

cgcctgaagt ttttcgctag cccggcaggc aaggacgagt atgcgcagtg gattgtcgcg 2760

agccaccgta gcctgctgga agtgatggag gcgttcccga gcgcgaagcc tccgctcggc 2820

gtctttttcg catcggttgc gcctcgcctg caaccgcgtt attactcaat cagcagctct 2880

ccgaaattcg cgccgaatcg tattcacgtt acttgcgcgc tggtttatga gcaaactccg 2940

agcggtcgtg ttcacaaggg cgtttgctct acctggatga aaaacgcggt tcctatgacg 3000

gagagccaag actgtagctg ggctccgatt tatgttcgca cgtctaactt tcgcctgcct 3060

agcgacccga aggtgccagt gattatgatt ggtccgggta ccggtctggc accgttccgc 3120

ggtttcctgc aagaacgtct ggcacagaaa gaagctggta cggaattggg caccgcaatt 3180

ctgttctttg gttgtcgtaa tcgtaaagtg gactttatct atgaggatga actgaacaac 3240

ttcgtggaaa ccggtgccct gagcgaattg gtgacggctt tttctcgtga gggtgcgacc 3300

aaagaatacg tgcagcacaa gatgacgcag aaagcaagcg acatttggaa tctgctgtcc 3360

gaaggtgcgt acctgtatgt ctgtggcgac gcgaagggca tggcaaaaga cgttcaccgt 3420

accctgcaca ccattgtgca ggagcaaggt agcctggact cttcgaaggc ggaattgtac 3480

gtcaaaaacc tgcaaatggc cggtcgttat ctgcgtgacg tttggtaaaa gctt 3534

<210> 14

<211> 3534

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-P2O-aaCPR 插入DNA 序列

<400> 14

catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60

accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120

ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga cagaggaaga tataccatgg cactggacaa actggacctg 1560

tacgtaatca tcaccttagt cgtcgccgtg gccgcgtatt ttgcgaaaaa tcgccgctcg 1620

tctagcgcag ccaagaaagc cgcggagagc ccggttattg tcgtcccgaa gaaggttacg 1680

gaggacgaag tggacgacgg tcgtaaaaag gtcacggtgt tcttcggcac gcagactggt 1740

accgctgaag gtttcgcgaa ggcgctggtt gaagaagcaa aggcgcgcta tgaaaaggca 1800

gtgttcaagg ttatcgatct ggacgattac gccgcagagg acgacgaata cgaggagaag 1860

ttgaaaaagg agtccctcgc cttcttcttc ctggcgacgt acggcgatgg tgagccgacc 1920

gataacgcag ctcgtttcta caagtggttc accgagggtg aggagaaggg tgagtggctg 1980

gataaactgc aatatgcggt ctttggtctg ggcaaccgcc aatatgagca cttcaataag 2040

atcgcaaagg ttgtggatga gaaactggtc gagcagggtg ccaagcgcct ggtgccggtt 2100

ggcatgggtg atgacgatca gtgcatcgag gatgacttca ccgcctggaa ggagctggtg 2160

tggccggagc tggaccaact gttgcgcgac gaagatgaca ccagcgttgc gacgccgtat 2220

accgcggcag ttggcgaata tcgtgttgtt tttcatgata agccggaaac ctacgatcag 2280

gatcaactga ccaatggtca tgctgtgcat gacgcgcagc acccgtgcag aagcaatgtt 2340

gctgttaaga aagaattgca ctctccgctg tccgatcgca gctgcaccca cctggaattt 2400

gacatcagca ataccggttt gagctacgaa acgggcgatc acgtcggtgt gtatgtggaa 2460

aatctgagcg aagttgtcga tgaggctgag aagctgatcg gtttaccacc gcacacctac 2520

ttcagcgtgc atactgacaa tgaggatggc accccactgg gcggtgctag cctgccaccg 2580

cctttcccgc cttgcaccct gcgcaaagcc ctcgctagct acgctgatgt gctgagcagc 2640

ccgaagaaga gcgcactgct ggcactggca gcacacgcta ccgattccac cgaagccgat 2700

cgcctgaagt ttttcgctag cccggcaggc aaggacgagt atgcgcagtg gattgtcgcg 2760

agccaccgta gcctgctgga agtgatggag gcgttcccga gcgcgaagcc tccgctcggc 2820

gtctttttcg catcggttgc gcctcgcctg caaccgcgtt attactcaat cagcagctct 2880

ccgaaattcg cgccgaatcg tattcacgtt acttgcgcgc tggtttatga gcaaactccg 2940

agcggtcgtg ttcacaaggg cgtttgctct acctggatga aaaacgcggt tcctatgacg 3000

gagagccaag actgtagctg ggctccgatt tatgttcgca cgtctaactt tcgcctgcct 3060

agcgacccga aggtgccagt gattatgatt ggtccgggta ccggtctggc accgttccgc 3120

ggtttcctgc aagaacgtct ggcacagaaa gaagctggta cggaattggg caccgcaatt 3180

ctgttctttg gttgtcgtaa tcgtaaagtg gactttatct atgaggatga actgaacaac 3240

ttcgtggaaa ccggtgccct gagcgaattg gtgacggctt tttctcgtga gggtgcgacc 3300

aaagaatacg tgcagcacaa gatgacgcag aaagcaagcg acatttggaa tctgctgtcc 3360

gaaggtgcgt acctgtatgt ctgtggcgac gcgaagggca tggcaaaaga cgttcaccgt 3420

accctgcaca ccattgtgca ggagcaaggt agcctggact cttcgaaggc ggaattgtac 3480

gtcaaaaacc tgcaaatggc cggtcgttat ctgcgtgacg tttggtaaaa gctt 3534

<210> 15

<211> 3684

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-P2-CPRm 插入DNA 序列

<400> 15

catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60

accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120

ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aaccctgact 1500

gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gctt 3684

<210> 16

<211> 3684

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-P2O-CPRm 插入DNA 序列

<400> 16

catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60

accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120

ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gctt 3684

<210> 17

<211> 3498

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-65188-aaCPR 插入DNA 序列

<400> 17

catatggcac tcttactggc agtattctgg tccgccctga tcattcttgt aacccgcacg 60

actagcaaaa agaatctgtt gccggagcca tggcgtctgc cgattatcgg tcacatgcac 120

catttgatcg gcaccatgcc gcatcgtggt gttatggaac tggcccgtaa gcatggcagc 180

ctgatgcacc tgcaactggg tgaagtctct acgattgttg tcagcagccc gcgttgggcg 240

aaagaggtct tgaccaccta tgatatcacc ttcgccaatc gcccggaaac cctgactggc 300

gagatcgtcg cataccacaa cacggatatc gtcctggcgc cgtatggtga gtattggcgt 360

caactgcgta aactgtgcac gctggagctg ctgagcaaca agaaagtgaa gagcttccag 420

agcctgcgcg aagaagagtg ttggaacctg gtcaaggaca tccgcagcac cggccaaggt 480

agcccaatca atctgtcgga gaacattttc aagatgattg cgacgattct gagccgtgct 540

gcgttcggta agggtattaa ggatcaaatg aagtttaccg aactggtgaa agaaatcctg 600

cgtctgaccg gcggttttga tgtcgctgac atcttcccta gcaagaagtt gctgcaccac 660

ctgagcggca agcgtgcaaa actgaccaat atccataaca agctggataa tctgatcaat 720

aacatcatcg cagagcaccc gggcaaccgt acctcgtcct cccaggaaac gctgctggac 780

gttctgctgc gcctgaaaga gtctgcggag tttccgctga ccgccgacaa cgttaaagca 840

gtgatcctgg atatgttcgg cgctggtacg gataccagca gcgcgacgat cgagtgggcg 900

attagcgagc tgattcgctg ccctcgcgcg atggagaaag tgcagacgga attgcgtcag 960

gcactgaatg gcaaagagcg tattcaggaa gaggatttgc aggagctgaa ttatctgaag 1020

ctggtgatta aagaaaccct gcgcctgcat ccgccgttgc cgctggtgat gccgcgtgag 1080

tgccgtgaac cgtgtgtttt gggcggttac gacattccga gcaaaacgaa gctgatcgtt 1140

aatgttttcg cgattaaccg tgacccggaa tactggaaag acgcggaaac gtttatgccg 1200

gagcgttttg agaatagccc gattaccgtt atgggttccg agtacgaata cctgccattt 1260

ggtgctggtc gtcgtatgtg tcctggtgca gcgctgggtc tggccaacgt ggaactgccg 1320

ctggcgcaca ttctgtacta tttcaactgg aaactgccga acggcaagac cttcgaagat 1380

ttggacatga ccgagagctt tggtgccact gtgcagcgca aaaccgagct gctgctggtt 1440

ccgaccgact ttcaaacgct gactgcgagc acctaatgag tcgacagagg aagatatacc 1500

atggcactgg acaaactgga cctgtacgta atcatcacct tagtcgtcgc cgtggccgcg 1560

tattttgcga aaaatcgccg ctcgtctagc gcagccaaga aagccgcgga gagcccggtt 1620

attgtcgtcc cgaagaaggt tacggaggac gaagtggacg acggtcgtaa aaaggtcacg 1680

gtgttcttcg gcacgcagac tggtaccgct gaaggtttcg cgaaggcgct ggttgaagaa 1740

gcaaaggcgc gctatgaaaa ggcagtgttc aaggttatcg atctggacga ttacgccgca 1800

gaggacgacg aatacgagga gaagttgaaa aaggagtccc tcgccttctt cttcctggcg 1860

acgtacggcg atggtgagcc gaccgataac gcagctcgtt tctacaagtg gttcaccgag 1920

ggtgaggaga agggtgagtg gctggataaa ctgcaatatg cggtctttgg tctgggcaac 1980

cgccaatatg agcacttcaa taagatcgca aaggttgtgg atgagaaact ggtcgagcag 2040

ggtgccaagc gcctggtgcc ggttggcatg ggtgatgacg atcagtgcat cgaggatgac 2100

ttcaccgcct ggaaggagct ggtgtggccg gagctggacc aactgttgcg cgacgaagat 2160

gacaccagcg ttgcgacgcc gtataccgcg gcagttggcg aatatcgtgt tgtttttcat 2220

gataagccgg aaacctacga tcaggatcaa ctgaccaatg gtcatgctgt gcatgacgcg 2280

cagcacccgt gcagaagcaa tgttgctgtt aagaaagaat tgcactctcc gctgtccgat 2340

cgcagctgca cccacctgga atttgacatc agcaataccg gtttgagcta cgaaacgggc 2400

gatcacgtcg gtgtgtatgt ggaaaatctg agcgaagttg tcgatgaggc tgagaagctg 2460

atcggtttac caccgcacac ctacttcagc gtgcatactg acaatgagga tggcacccca 2520

ctgggcggtg ctagcctgcc accgcctttc ccgccttgca ccctgcgcaa agccctcgct 2580

agctacgctg atgtgctgag cagcccgaag aagagcgcac tgctggcact ggcagcacac 2640

gctaccgatt ccaccgaagc cgatcgcctg aagtttttcg ctagcccggc aggcaaggac 2700

gagtatgcgc agtggattgt cgcgagccac cgtagcctgc tggaagtgat ggaggcgttc 2760

ccgagcgcga agcctccgct cggcgtcttt ttcgcatcgg ttgcgcctcg cctgcaaccg 2820

cgttattact caatcagcag ctctccgaaa ttcgcgccga atcgtattca cgttacttgc 2880

gcgctggttt atgagcaaac tccgagcggt cgtgttcaca agggcgtttg ctctacctgg 2940

atgaaaaacg cggttcctat gacggagagc caagactgta gctgggctcc gatttatgtt 3000

cgcacgtcta actttcgcct gcctagcgac ccgaaggtgc cagtgattat gattggtccg 3060

ggtaccggtc tggcaccgtt ccgcggtttc ctgcaagaac gtctggcaca gaaagaagct 3120

ggtacggaat tgggcaccgc aattctgttc tttggttgtc gtaatcgtaa agtggacttt 3180

atctatgagg atgaactgaa caacttcgtg gaaaccggtg ccctgagcga attggtgacg 3240

gctttttctc gtgagggtgc gaccaaagaa tacgtgcagc acaagatgac gcagaaagca 3300

agcgacattt ggaatctgct gtccgaaggt gcgtacctgt atgtctgtgg cgacgcgaag 3360

ggcatggcaa aagacgttca ccgtaccctg cacaccattg tgcaggagca aggtagcctg 3420

gactcttcga aggcggaatt gtacgtcaaa aacctgcaaa tggccggtcg ttatctgcgt 3480

gacgtttggt aaaagctt 3498

<210> 18

<211> 3648

<212> DNA

<213> 人工序列

<220>

<223> pCWori-CYP71AV8-65188-CPRm 插入DNA 序列

<400> 18

catatggcac tcttactggc agtattctgg tccgccctga tcattcttgt aacccgcacg 60

actagcaaaa agaatctgtt gccggagcca tggcgtctgc cgattatcgg tcacatgcac 120

catttgatcg gcaccatgcc gcatcgtggt gttatggaac tggcccgtaa gcatggcagc 180

ctgatgcacc tgcaactggg tgaagtctct acgattgttg tcagcagccc gcgttgggcg 240

aaagaggtct tgaccaccta tgatatcacc ttcgccaatc gcccggaaac cctgactggc 300

gagatcgtcg cataccacaa cacggatatc gtcctggcgc cgtatggtga gtattggcgt 360

caactgcgta aactgtgcac gctggagctg ctgagcaaca agaaagtgaa gagcttccag 420

agcctgcgcg aagaagagtg ttggaacctg gtcaaggaca tccgcagcac cggccaaggt 480

agcccaatca atctgtcgga gaacattttc aagatgattg cgacgattct gagccgtgct 540

gcgttcggta agggtattaa ggatcaaatg aagtttaccg aactggtgaa agaaatcctg 600

cgtctgaccg gcggttttga tgtcgctgac atcttcccta gcaagaagtt gctgcaccac 660

ctgagcggca agcgtgcaaa actgaccaat atccataaca agctggataa tctgatcaat 720

aacatcatcg cagagcaccc gggcaaccgt acctcgtcct cccaggaaac gctgctggac 780

gttctgctgc gcctgaaaga gtctgcggag tttccgctga ccgccgacaa cgttaaagca 840

gtgatcctgg atatgttcgg cgctggtacg gataccagca gcgcgacgat cgagtgggcg 900

attagcgagc tgattcgctg ccctcgcgcg atggagaaag tgcagacgga attgcgtcag 960

gcactgaatg gcaaagagcg tattcaggaa gaggatttgc aggagctgaa ttatctgaag 1020

ctggtgatta aagaaaccct gcgcctgcat ccgccgttgc cgctggtgat gccgcgtgag 1080

tgccgtgaac cgtgtgtttt gggcggttac gacattccga gcaaaacgaa gctgatcgtt 1140

aatgttttcg cgattaaccg tgacccggaa tactggaaag acgcggaaac gtttatgccg 1200

gagcgttttg agaatagccc gattaccgtt atgggttccg agtacgaata cctgccattt 1260

ggtgctggtc gtcgtatgtg tcctggtgca gcgctgggtc tggccaacgt ggaactgccg 1320

ctggcgcaca ttctgtacta tttcaactgg aaactgccga acggcaagac cttcgaagat 1380

ttggacatga ccgagagctt tggtgccact gtgcagcgca aaaccgagct gctgctggtt 1440

ccgaccgact ttcaaacgct gactgcgagc acctaatgag tcgactaact ttaagaagga 1500

gatatatcca tggaacctag ctctcagaaa ctgtctccgt tggaatttgt tgctgctatc 1560

ctgaagggcg actacagcag cggtcaggtt gaaggtggtc caccgccagg tctggcagct 1620

atgttgatgg aaaataagga tttggtgatg gttctgacga cgtccgtggc agtcctgatc 1680

ggctgtgtcg tggtcctggc atggcgtcgt gcggcaggta gcggtaagta caagcaacct 1740

gaactgccta aactggtggt cccgaaagca gccgaaccgg aggaggcaga ggatgataaa 1800

accaagatca gcgtgttttt cggcacccaa accggtacgg cagaaggttt cgcgaaggct 1860

tttgttgaag aggccaaggc gcgttatcag caggcccgtt tcaaagttat cgacctggac 1920

gactatgcgg cagacgatga cgagtacgaa gagaaactga agaaggaaaa cttggcattc 1980

ttcttcttgg cgtcctacgg tgacggcgag ccgacggaca acgcggcacg cttttacaaa 2040

tggtttacgg agggtaagga ccgtggtgaa tggctgaaca atctgcagta cggcgttttt 2100

ggtctgggta accgtcaata tgagcatttc aataagatcg ccattgtcgt cgatgatctg 2160

atcttcgagc aaggtggcaa gaagctggtt ccggtgggtc tgggtgacga tgaccagtgc 2220

attgaggatg attttgcggc gtggcgtgaa ctggtctggc cggaactgga taaactgctg 2280

cgtaacgaag acgacgctac cgtggcaacc ccgtacagcg ccgctgtgct gcaataccgc 2340

gtggttttcc acgatcacat tgacggcctg attagcgaaa acggtagccc gaacggtcat 2400

gctaatggca ataccgtgta cgatgcgcaa cacccgtgcc gtagcaacgt cgcggtcaag 2460

aaggaattgc atactccggc gagcgatcgc agctgcaccc acctggaatt taacattagc 2520

ggtaccggcc tgatgtacga gacgggtgac cacgtcggtg tgtattgcga gaacctgttg 2580

gaaaccgtgg aggaggccga gaagttgttg aacctgagcc cgcagacgta cttctccgtt 2640

cacaccgaca acgaggacgg tacgccgttg agcggcagca gcctgccgcc accgtttccg 2700

ccgtgcacct tgcgcacggc attgaccaaa tacgcagact tgacttctgc accgaaaaag 2760

tcggtgctgg tggcgctggc cgagtacgca tctgaccagg gtgaagcgga tcgtttgcgt 2820

ttcttggcga gcccgagcgg caaagaggaa tatgcacagt acatcttggc aagccagcgc 2880

acgctgctgg aggtcatggc ggagttcccg tcggcgaaac cgccgctggg tgtctttttc 2940

gcgggtgtcg ctccgcgcct gcagccgcgt ttctattcca ttagctctag cccgaagatc 3000

gcaccgttcc gtattcacgt gacctgcgcc ctggtttatg acaaatcccc taccggtcgc 3060

gttcataagg gcatctgtag cacgtggatg aaaaatgcgg tcccgctgga agaaagcaac 3120

gattgttcct gggctccgat cttcgtccgc aacagcaact tcaagctgcc gaccgacccg 3180

aaggttccga ttatcatgat tggtccgggt accggtctgg ccccttttcg tggctttttg 3240

caagagcgct tggcgttgaa agagagcggt gctgaattgg gtccggcgat cttgttcttt 3300

ggttgccgta accgtaaaat ggactttatt tacgaggatg aactgaatga tttcgtcaaa 3360

gcgggcgttg tcagcgagct gatcgtcgct tttagccgcg aaggcccgat gaaagaatac 3420

gtgcaacaca aaatgagcca acgtgcctcc gatgtgtgga acatcattag cgacggtggt 3480

tatgtttatg tttgcggtga cgcgaagggt atggctcgtg atgttcaccg taccctgcat 3540

accatcgcac aggagcaagg tagcatgtcc agctcggagg ccgaaggtat ggtcaaaaac 3600

ctgcaaacca ccggtcgtta cctgcgtgat gtgtggtaat aaaagctt 3648

<210> 19

<211> 1665

<212> DNA

<213> 人工序列

<220>

<223> α-檀香萜合酶优化的 cDNA 序列

<400> 19

atggaccaca tgtctaccca gcaggttagc tccgagaata tcgttcgcaa cgcggcgaac 60

ttccacccga atatctgggg taatcatttc ttgacgtgtc caagccagac gatcgattct 120

tggacgcaac aacaccataa agagctgaaa gaagaggtcc gcaagatgat ggtgagcgac 180

gcaaacaaac cggcacaacg tctgcgtctg attgacaccg ttcaacgttt gggcgtggcg 240

tatcatttcg aaaaagaaat cgatgacgct ctggaaaaga tcggtcacga tccgtttgac 300

gataaggatg acctgtatat cgttagcctg tgttttcgcc tgctgcgtca gcatggcatc 360

aagattagct gcgatgtttt tgagaagttc aaagacgacg atggcaagtt taaggcttcc 420

ctgatgaatg atgtccaagg tatgctgtcg ttgtatgaag cggcccacct ggcaattcat 480

ggcgaggaca tcctggatga ggctattgtc tttacgacca cccacctgaa gagcaccgtt 540

tctaactccc cggtcaattc cacctttgcg gaacagattc gccacagcct gcgtgtgccg 600

ctgcgtaagg cagtcccgcg tttggagagc cgctacttcc tggatatcta tagccgtgac 660

gacctgcacg acaagactct gctgaacttt gccaaactgg acttcaacat cctgcaggcg 720

atgcaccaga aagaggcaag cgagatgacc cgttggtggc gtgatttcga tttcctgaag 780

aagctgccgt acattcgtga tcgcgtggtt gaactgtact tttggatttt ggtcggtgtg 840

agctaccaac cgaaattcag cacgggtcgt atctttttga gcaagattat ctgtctggaa 900

accctggtgg acgacacgtt tgatgcgtac ggtactttcg acgaactggc cattttcacc 960

gaggccgtta cgcgttggga cctgggtcat cgcgacgcgc tgcctgagta catgaaattc 1020

attttcaaga ccctgattga tgtgtacagc gaggcggaac aagagctggc aaaagagggc 1080

cgctcctata gcattcacta tgcgatccgt agcttccagg agttggtcat gaagtacttt 1140

tgcgaggcga aatggctgaa taagggttat gttccgagcc tggatgacta caagagcgtc 1200

agcctgcgca gcatcggctt cctgccgatc gccgtggctt cttttgtttt catgggcgac 1260

attgctacga aagaggtttt tgagtgggaa atgaataacc cgaaaatcat catcgcagcc 1320

gaaaccattt tccgctttct ggatgacatt gcaggtcatc gcttcgaaca aaaacgtgag 1380

cacagcccga gcgcaatcga gtgctacaaa aaccaacatg gtgtctcgga agaagaggca 1440

gtgaaagcgc tgagcttgga ggtcgccaat tcgtggaaag acattaacga agagctgctg 1500

ctgaacccta tggcaattcc actgccgttg ctgcaggtga tcctggattt gagccgtagc 1560

gcggacttca tgtacggtaa tgcgcaggac cgtttcacgc actccaccat gatgaaagat 1620

caagttgacc tggttctgaa agatccggtg aaactggacg attaa 1665

<210> 20

<211> 554

<212> PRT

<213> 人工序列

<220>

<223> α-檀香萜合酶氨基酸序列

<400> 20

Met Asp His Met Ser Thr Gln Gln Val Ser Ser Glu Asn Ile Val Arg

1 5 10 15

Asn Ala Ala Asn Phe His Pro Asn Ile Trp Gly Asn His Phe Leu Thr

20 25 30

Cys Pro Ser Gln Thr Ile Asp Ser Trp Thr Gln Gln His His Lys Glu

35 40 45

Leu Lys Glu Glu Val Arg Lys Met Met Val Ser Asp Ala Asn Lys Pro

50 55 60

Ala Gln Arg Leu Arg Leu Ile Asp Thr Val Gln Arg Leu Gly Val Ala

65 70 75 80

Tyr His Phe Glu Lys Glu Ile Asp Asp Ala Leu Glu Lys Ile Gly His

85 90 95

Asp Pro Phe Asp Asp Lys Asp Asp Leu Tyr Ile Val Ser Leu Cys Phe

100 105 110

Arg Leu Leu Arg Gln His Gly Ile Lys Ile Ser Cys Asp Val Phe Glu

115 120 125

Lys Phe Lys Asp Asp Asp Gly Lys Phe Lys Ala Ser Leu Met Asn Asp

130 135 140

Val Gln Gly Met Leu Ser Leu Tyr Glu Ala Ala His Leu Ala Ile His

145 150 155 160

Gly Glu Asp Ile Leu Asp Glu Ala Ile Val Phe Thr Thr Thr His Leu

165 170 175

Lys Ser Thr Val Ser Asn Ser Pro Val Asn Ser Thr Phe Ala Glu Gln

180 185 190

Ile Arg His Ser Leu Arg Val Pro Leu Arg Lys Ala Val Pro Arg Leu

195 200 205

Glu Ser Arg Tyr Phe Leu Asp Ile Tyr Ser Arg Asp Asp Leu His Asp

210 215 220

Lys Thr Leu Leu Asn Phe Ala Lys Leu Asp Phe Asn Ile Leu Gln Ala

225 230 235 240

Met His Gln Lys Glu Ala Ser Glu Met Thr Arg Trp Trp Arg Asp Phe

245 250 255

Asp Phe Leu Lys Lys Leu Pro Tyr Ile Arg Asp Arg Val Val Glu Leu

260 265 270

Tyr Phe Trp Ile Leu Val Gly Val Ser Tyr Gln Pro Lys Phe Ser Thr

275 280 285

Gly Arg Ile Phe Leu Ser Lys Ile Ile Cys Leu Glu Thr Leu Val Asp

290 295 300

Asp Thr Phe Asp Ala Tyr Gly Thr Phe Asp Glu Leu Ala Ile Phe Thr

305 310 315 320

Glu Ala Val Thr Arg Trp Asp Leu Gly His Arg Asp Ala Leu Pro Glu

325 330 335

Tyr Met Lys Phe Ile Phe Lys Thr Leu Ile Asp Val Tyr Ser Glu Ala

340 345 350

Glu Gln Glu Leu Ala Lys Glu Gly Arg Ser Tyr Ser Ile His Tyr Ala

355 360 365

Ile Arg Ser Phe Gln Glu Leu Val Met Lys Tyr Phe Cys Glu Ala Lys

370 375 380

Trp Leu Asn Lys Gly Tyr Val Pro Ser Leu Asp Asp Tyr Lys Ser Val

385 390 395 400

Ser Leu Arg Ser Ile Gly Phe Leu Pro Ile Ala Val Ala Ser Phe Val

405 410 415

Phe Met Gly Asp Ile Ala Thr Lys Glu Val Phe Glu Trp Glu Met Asn

420 425 430

Asn Pro Lys Ile Ile Ile Ala Ala Glu Thr Ile Phe Arg Phe Leu Asp

435 440 445

Asp Ile Ala Gly His Arg Phe Glu Gln Lys Arg Glu His Ser Pro Ser

450 455 460

Ala Ile Glu Cys Tyr Lys Asn Gln His Gly Val Ser Glu Glu Glu Ala

465 470 475 480

Val Lys Ala Leu Ser Leu Glu Val Ala Asn Ser Trp Lys Asp Ile Asn

485 490 495

Glu Glu Leu Leu Leu Asn Pro Met Ala Ile Pro Leu Pro Leu Leu Gln

500 505 510

Val Ile Leu Asp Leu Ser Arg Ser Ala Asp Phe Met Tyr Gly Asn Ala

515 520 525

Gln Asp Arg Phe Thr His Ser Thr Met Met Lys Asp Gln Val Asp Leu

530 535 540

Val Leu Lys Asp Pro Val Lys Leu Asp Asp

545 550

<210> 21

<211> 1728

<212> DNA

<213> 人工序列

<220>

<223> SaTps8201-1-FL_optEc (α-檀香萜合酶优化的全长cDNA)包括RBS区域和限制位点

<400> 21

aggaggtaaa acatatggac agcagcaccg ccaccgcaat gaccgcacca ttcatcgacc 60

cgacggatca tgtgaatctg aaaaccgaca cggatgcgag cgaaaatcgt cgtatgggta 120

actacaagcc gagcatttgg aactacgatt ttctgcagtc cctggcgacg caccacaaca 180

ttgttgaaga gcgtcacctg aagctggcag agaaactgaa aggtcaagtg aaattcatgt 240

tcggtgcgcc gatggagcca ttggctaagt tggagctggt tgatgtggtg caacgcttgg 300

gtctgaacca cctgttcgag actgaaatca aagaagctct gttcagcatc tacaaagatg 360

gcagcaatgg ctggtggttt ggccatctgc atgctacctc tttgcgcttc cgtctgttgc 420

gccaatgtgg cctgtttatc ccgcaggacg ttttcaaaac ctttcaaaac aagaccggtg 480

agtttgacat gaagctgtgg gacaacgtta agggcctgct gagcctgtac gaggcgagct 540

acctgggctg gaagggcgag aacatcttgg atgaagcaaa ggcgttcacg accaagtgcc 600

tgaagagcgc atgggagaac attagcgaga agtggctggc gaagcgtgtt aaacatgcgt 660

tggcgctgcc gctgcactgg cgtgttccgc gtattgaagc acgctggttt atcgaggtgt 720

acgaacaaga ggccaatatg aatccgacgc tgctgaaact ggcgaaactg gacttcaaca 780

tggtccaaag cattcaccag aaagaaatcg gtgaactggc ccgctggtgg gttactaccg 840

gcctggacaa gctggatttc gcacgcaaca atctgttgca gtcttatatg tggagctgcg 900

ccatcgcgtc cgacccgaaa ttcaaactgg cgcgtgaaac cattgtcgag atcggttccg 960

tgttgacggt tgtcgacgac ggctatgatg tgtacggttc tatggatgag ctggacctgt 1020

acaccagctc ggtggagcgt tggtcctgtg tcaaaattga caagctgcct aatacgctga 1080

agctgatctt tatgtctatg ttcaacaaaa ccaacgaggt gggtctgcgt gttcaacacg 1140

agcgtggtta caatagcatc ccgaccttca ttaaggcgtg ggtggaacag tgtaagagct 1200

atcaaaaaga ggcgcgttgg tttcatggtg gtcacacgcc tccgctggaa gaatacagcc 1260

tgaacggtct ggtcagcatt ggttttccgc tgttgctgat caccggctat gttgcgattg 1320

ctgagaatga agcagccctg gataaagtcc acccgctgcc ggacctgctg cattattcca 1380

gcttgctgag ccgtctgatt aatgatatcg gcactagccc ggatgaaatg gcgcgtggtg 1440

acaatctgaa gagcattcac tgctatatga atgaaaccgg tgccagcgaa gaggtcgcac 1500

gcgagcacat caaaggcgtc atcgaagaga attggaaaat tctgaaccag tgttgctttg 1560

accagtccca gttccaggag ccgttcatca cgtttaacct gaacagcgtg cgcggctcgc 1620

atttcttcta tgaatttggt gatggttttg gtgttaccga cagctggacc aaggtggata 1680

tgaaaagcgt cctgattgat ccgattccgc tgggtgaaga gtaagctt 1728

<210> 22

<211> 569

<212> PRT

<213> 人工序列

<220>

<223> SaTps8201-1-FL (α/β-檀香萜合酶全长)氨基酸序列

<400> 22

Met Asp Ser Ser Thr Ala Thr Ala Met Thr Ala Pro Phe Ile Asp Pro

1 5 10 15

Thr Asp His Val Asn Leu Lys Thr Asp Thr Asp Ala Ser Glu Asn Arg

20 25 30

Arg Met Gly Asn Tyr Lys Pro Ser Ile Trp Asn Tyr Asp Phe Leu Gln

35 40 45

Ser Leu Ala Thr His His Asn Ile Val Glu Glu Arg His Leu Lys Leu

50 55 60

Ala Glu Lys Leu Lys Gly Gln Val Lys Phe Met Phe Gly Ala Pro Met

65 70 75 80

Glu Pro Leu Ala Lys Leu Glu Leu Val Asp Val Val Gln Arg Leu Gly

85 90 95

Leu Asn His Leu Phe Glu Thr Glu Ile Lys Glu Ala Leu Phe Ser Ile

100 105 110

Tyr Lys Asp Gly Ser Asn Gly Trp Trp Phe Gly His Leu His Ala Thr

115 120 125

Ser Leu Arg Phe Arg Leu Leu Arg Gln Cys Gly Leu Phe Ile Pro Gln

130 135 140

Asp Val Phe Lys Thr Phe Gln Asn Lys Thr Gly Glu Phe Asp Met Lys

145 150 155 160

Leu Trp Asp Asn Val Lys Gly Leu Leu Ser Leu Tyr Glu Ala Ser Tyr

165 170 175

Leu Gly Trp Lys Gly Glu Asn Ile Leu Asp Glu Ala Lys Ala Phe Thr

180 185 190

Thr Lys Cys Leu Lys Ser Ala Trp Glu Asn Ile Ser Glu Lys Trp Leu

195 200 205

Ala Lys Arg Val Lys His Ala Leu Ala Leu Pro Leu His Trp Arg Val

210 215 220

Pro Arg Ile Glu Ala Arg Trp Phe Ile Glu Val Tyr Glu Gln Glu Ala

225 230 235 240

Asn Met Asn Pro Thr Leu Leu Lys Leu Ala Lys Leu Asp Phe Asn Met

245 250 255

Val Gln Ser Ile His Gln Lys Glu Ile Gly Glu Leu Ala Arg Trp Trp

260 265 270

Val Thr Thr Gly Leu Asp Lys Leu Asp Phe Ala Arg Asn Asn Leu Leu

275 280 285

Gln Ser Tyr Met Trp Ser Cys Ala Ile Ala Ser Asp Pro Lys Phe Lys

290 295 300

Leu Ala Arg Glu Thr Ile Val Glu Ile Gly Ser Val Leu Thr Val Val

305 310 315 320

Asp Asp Gly Tyr Asp Val Tyr Gly Ser Met Asp Glu Leu Asp Leu Tyr

325 330 335

Thr Ser Ser Val Glu Arg Trp Ser Cys Val Lys Ile Asp Lys Leu Pro

340 345 350

Asn Thr Leu Lys Leu Ile Phe Met Ser Met Phe Asn Lys Thr Asn Glu

355 360 365

Val Gly Leu Arg Val Gln His Glu Arg Gly Tyr Asn Ser Ile Pro Thr

370 375 380

Phe Ile Lys Ala Trp Val Glu Gln Cys Lys Ser Tyr Gln Lys Glu Ala

385 390 395 400

Arg Trp Phe His Gly Gly His Thr Pro Pro Leu Glu Glu Tyr Ser Leu

405 410 415

Asn Gly Leu Val Ser Ile Gly Phe Pro Leu Leu Leu Ile Thr Gly Tyr

420 425 430

Val Ala Ile Ala Glu Asn Glu Ala Ala Leu Asp Lys Val His Pro Leu

435 440 445

Pro Asp Leu Leu His Tyr Ser Ser Leu Leu Ser Arg Leu Ile Asn Asp

450 455 460

Ile Gly Thr Ser Pro Asp Glu Met Ala Arg Gly Asp Asn Leu Lys Ser

465 470 475 480

Ile His Cys Tyr Met Asn Glu Thr Gly Ala Ser Glu Glu Val Ala Arg

485 490 495

Glu His Ile Lys Gly Val Ile Glu Glu Asn Trp Lys Ile Leu Asn Gln

500 505 510

Cys Cys Phe Asp Gln Ser Gln Phe Gln Glu Pro Phe Ile Thr Phe Asn

515 520 525

Leu Asn Ser Val Arg Gly Ser His Phe Phe Tyr Glu Phe Gly Asp Gly

530 535 540

Phe Gly Val Thr Asp Ser Trp Thr Lys Val Asp Met Lys Ser Val Leu

545 550 555 560

Ile Asp Pro Ile Pro Leu Gly Glu Glu

565

<210> 23

<211> 5361

<212> DNA

<213> 人工序列

<220>

<223> 包含CYP71AV-P2、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的ClASS的合成操纵子的DNA 序列

<400> 23

catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60

accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120

ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gcttgaagga gatatactaa tgtctaccca gcaggttagc 3720

tccgagaata tcgttcgcaa cgcggcgaac ttccacccga atatctgggg taatcatttc 3780

ttgacgtgtc caagccagac gatcgattct tggacgcaac aacaccataa agagctgaaa 3840

gaagaggtcc gcaagatgat ggtgagcgac gcaaacaaac cggcacaacg tctgcgtctg 3900

attgacaccg ttcaacgttt gggcgtggcg tatcatttcg aaaaagaaat cgatgacgct 3960

ctggaaaaga tcggtcacga tccgtttgac gataaggatg acctgtatat cgttagcctg 4020

tgttttcgcc tgctgcgtca gcatggcatc aagattagct gcgatgtttt tgagaagttc 4080

aaagacgacg atggcaagtt taaggcttcc ctgatgaatg atgtccaagg tatgctgtcg 4140

ttgtatgaag cggcccacct ggcaattcat ggcgaggaca tcctggatga ggctattgtc 4200

tttacgacca cccacctgaa gagcaccgtt tctaactccc cggtcaattc cacctttgcg 4260

gaacagattc gccacagcct gcgtgtgccg ctgcgtaagg cagtcccgcg tttggagagc 4320

cgctacttcc tggatatcta tagccgtgac gacctgcacg acaagactct gctgaacttt 4380

gccaaactgg acttcaacat cctgcaggcg atgcaccaga aagaggcaag cgagatgacc 4440

cgttggtggc gtgatttcga tttcctgaag aagctgccgt acattcgtga tcgcgtggtt 4500

gaactgtact tttggatttt ggtcggtgtg agctaccaac cgaaattcag cacgggtcgt 4560

atctttttga gcaagattat ctgtctggaa accctggtgg acgacacgtt tgatgcgtac 4620

ggtactttcg acgaactggc cattttcacc gaggccgtta cgcgttggga cctgggtcat 4680

cgcgacgcgc tgcctgagta catgaaattc attttcaaga ccctgattga tgtgtacagc 4740

gaggcggaac aagagctggc aaaagagggc cgctcctata gcattcacta tgcgatccgt 4800

agcttccagg agttggtcat gaagtacttt tgcgaggcga aatggctgaa taagggttat 4860

gttccgagcc tggatgacta caagagcgtc agcctgcgca gcatcggctt cctgccgatc 4920

gccgtggctt cttttgtttt catgggcgac attgctacga aagaggtttt tgagtgggaa 4980

atgaataacc cgaaaatcat catcgcagcc gaaaccattt tccgctttct ggatgacatt 5040

gcaggtcatc gcttcgaaca aaaacgtgag cacagcccga gcgcaatcga gtgctacaaa 5100

aaccaacatg gtgtctcgga agaagaggca gtgaaagcgc tgagcttgga ggtcgccaat 5160

tcgtggaaag acattaacga agagctgctg ctgaacccta tggcaattcc actgccgttg 5220

ctgcaggtga tcctggattt gagccgtagc gcggacttca tgtacggtaa tgcgcaggac 5280

cgtttcacgc actccaccat gatgaaagat caagttgacc tggttctgaa agatccggtg 5340

aaactggacg attaagaatt c 5361

<210> 24

<211> 5414

<212> DNA

<213> 人工序列

<220>

<223> 包含CYP71AV-P2、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的SaSAS的合成操纵子的DNA 序列

<400> 24

catatggctc tgttattagc agttttttgg tcggcgctta taatcctcgt agtaacctac 60

accatatccc tcctaatcaa ccaatggcga aaaccgaaac cccaagggaa gttccccccg 120

ggcccatggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aaccctgact 1500

gcaagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gcttaggagg taaaacatat ggacagcagc accgccaccg 3720

caatgaccgc accattcatc gacccgacgg atcatgtgaa tctgaaaacc gacacggatg 3780

cgagcgaaaa tcgtcgtatg ggtaactaca agccgagcat ttggaactac gattttctgc 3840

agtccctggc gacgcaccac aacattgttg aagagcgtca cctgaagctg gcagagaaac 3900

tgaaaggtca agtgaaattc atgttcggtg cgccgatgga gccattggct aagttggagc 3960

tggttgatgt ggtgcaacgc ttgggtctga accacctgtt cgagactgaa atcaaagaag 4020

ctctgttcag catctacaaa gatggcagca atggctggtg gtttggccat ctgcatgcta 4080

cctctttgcg cttccgtctg ttgcgccaat gtggcctgtt tatcccgcag gacgttttca 4140

aaacctttca aaacaagacc ggtgagtttg acatgaagct gtgggacaac gttaagggcc 4200

tgctgagcct gtacgaggcg agctacctgg gctggaaggg cgagaacatc ttggatgaag 4260

caaaggcgtt cacgaccaag tgcctgaaga gcgcatggga gaacattagc gagaagtggc 4320

tggcgaagcg tgttaaacat gcgttggcgc tgccgctgca ctggcgtgtt ccgcgtattg 4380

aagcacgctg gtttatcgag gtgtacgaac aagaggccaa tatgaatccg acgctgctga 4440

aactggcgaa actggacttc aacatggtcc aaagcattca ccagaaagaa atcggtgaac 4500

tggcccgctg gtgggttact accggcctgg acaagctgga tttcgcacgc aacaatctgt 4560

tgcagtctta tatgtggagc tgcgccatcg cgtccgaccc gaaattcaaa ctggcgcgtg 4620

aaaccattgt cgagatcggt tccgtgttga cggttgtcga cgacggctat gatgtgtacg 4680

gttctatgga tgagctggac ctgtacacca gctcggtgga gcgttggtcc tgtgtcaaaa 4740

ttgacaagct gcctaatacg ctgaagctga tctttatgtc tatgttcaac aaaaccaacg 4800

aggtgggtct gcgtgttcaa cacgagcgtg gttacaatag catcccgacc ttcattaagg 4860

cgtgggtgga acagtgtaag agctatcaaa aagaggcgcg ttggtttcat ggtggtcaca 4920

cgcctccgct ggaagaatac agcctgaacg gtctggtcag cattggtttt ccgctgttgc 4980

tgatcaccgg ctatgttgcg attgctgaga atgaagcagc cctggataaa gtccacccgc 5040

tgccggacct gctgcattat tccagcttgc tgagccgtct gattaatgat atcggcacta 5100

gcccggatga aatggcgcgt ggtgacaatc tgaagagcat tcactgctat atgaatgaaa 5160

ccggtgccag cgaagaggtc gcacgcgagc acatcaaagg cgtcatcgaa gagaattgga 5220

aaattctgaa ccagtgttgc tttgaccagt cccagttcca ggagccgttc atcacgttta 5280

acctgaacag cgtgcgcggc tcgcatttct tctatgaatt tggtgatggt tttggtgtta 5340

ccgacagctg gaccaaggtg gatatgaaaa gcgtcctgat tgatccgatt ccgctgggtg 5400

aagagtaagc ttgc 5414

<210> 25

<211> 5361

<212> DNA

<213> 人工序列

<220>

<223> 包含CYP71AV-P2O、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的ClASS的合成操纵子的DNA序列

<400> 25

catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60

accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120

ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gcttgaagga gatatactaa tgtctaccca gcaggttagc 3720

tccgagaata tcgttcgcaa cgcggcgaac ttccacccga atatctgggg taatcatttc 3780

ttgacgtgtc caagccagac gatcgattct tggacgcaac aacaccataa agagctgaaa 3840

gaagaggtcc gcaagatgat ggtgagcgac gcaaacaaac cggcacaacg tctgcgtctg 3900

attgacaccg ttcaacgttt gggcgtggcg tatcatttcg aaaaagaaat cgatgacgct 3960

ctggaaaaga tcggtcacga tccgtttgac gataaggatg acctgtatat cgttagcctg 4020

tgttttcgcc tgctgcgtca gcatggcatc aagattagct gcgatgtttt tgagaagttc 4080

aaagacgacg atggcaagtt taaggcttcc ctgatgaatg atgtccaagg tatgctgtcg 4140

ttgtatgaag cggcccacct ggcaattcat ggcgaggaca tcctggatga ggctattgtc 4200

tttacgacca cccacctgaa gagcaccgtt tctaactccc cggtcaattc cacctttgcg 4260

gaacagattc gccacagcct gcgtgtgccg ctgcgtaagg cagtcccgcg tttggagagc 4320

cgctacttcc tggatatcta tagccgtgac gacctgcacg acaagactct gctgaacttt 4380

gccaaactgg acttcaacat cctgcaggcg atgcaccaga aagaggcaag cgagatgacc 4440

cgttggtggc gtgatttcga tttcctgaag aagctgccgt acattcgtga tcgcgtggtt 4500

gaactgtact tttggatttt ggtcggtgtg agctaccaac cgaaattcag cacgggtcgt 4560

atctttttga gcaagattat ctgtctggaa accctggtgg acgacacgtt tgatgcgtac 4620

ggtactttcg acgaactggc cattttcacc gaggccgtta cgcgttggga cctgggtcat 4680

cgcgacgcgc tgcctgagta catgaaattc attttcaaga ccctgattga tgtgtacagc 4740

gaggcggaac aagagctggc aaaagagggc cgctcctata gcattcacta tgcgatccgt 4800

agcttccagg agttggtcat gaagtacttt tgcgaggcga aatggctgaa taagggttat 4860

gttccgagcc tggatgacta caagagcgtc agcctgcgca gcatcggctt cctgccgatc 4920

gccgtggctt cttttgtttt catgggcgac attgctacga aagaggtttt tgagtgggaa 4980

atgaataacc cgaaaatcat catcgcagcc gaaaccattt tccgctttct ggatgacatt 5040

gcaggtcatc gcttcgaaca aaaacgtgag cacagcccga gcgcaatcga gtgctacaaa 5100

aaccaacatg gtgtctcgga agaagaggca gtgaaagcgc tgagcttgga ggtcgccaat 5160

tcgtggaaag acattaacga agagctgctg ctgaacccta tggcaattcc actgccgttg 5220

ctgcaggtga tcctggattt gagccgtagc gcggacttca tgtacggtaa tgcgcaggac 5280

cgtttcacgc actccaccat gatgaaagat caagttgacc tggttctgaa agatccggtg 5340

aaactggacg attaagaatt c 5361

<210> 26

<211> 5414

<212> DNA

<213> 人工序列

<220>

<223> 包含CYP71AV-P2O、CPRm以及在3'和5'末端包括NdeI和HindIII限制性位点的SaSAS的合成操纵子的DNA序列

<400> 26

catatggcac tgttgctggc tgtcttttgg tctgctctga ttattttggt ggttacctac 60

accatctccc tgctgattaa ccagtggcgt aaaccgaaac cacagggtaa attcccgccg 120

ggtccgtggc gtctgccgat tatcggtcac atgcaccatt tgatcggcac catgccgcat 180

cgtggtgtta tggaactggc ccgtaagcat ggcagcctga tgcacctgca actgggtgaa 240

gtctctacga ttgttgtcag cagcccgcgt tgggcgaaag aggtcttgac cacctatgat 300

atcaccttcg ccaatcgccc ggaaaccctg actggcgaga tcgtcgcata ccacaacacg 360

gatatcgtcc tggcgccgta tggtgagtat tggcgtcaac tgcgtaaact gtgcacgctg 420

gagctgctga gcaacaagaa agtgaagagc ttccagagcc tgcgcgaaga agagtgttgg 480

aacctggtca aggacatccg cagcaccggc caaggtagcc caatcaatct gtcggagaac 540

attttcaaga tgattgcgac gattctgagc cgtgctgcgt tcggtaaggg tattaaggat 600

caaatgaagt ttaccgaact ggtgaaagaa atcctgcgtc tgaccggcgg ttttgatgtc 660

gctgacatct tccctagcaa gaagttgctg caccacctga gcggcaagcg tgcaaaactg 720

accaatatcc ataacaagct ggataatctg atcaataaca tcatcgcaga gcacccgggc 780

aaccgtacct cgtcctccca ggaaacgctg ctggacgttc tgctgcgcct gaaagagtct 840

gcggagtttc cgctgaccgc cgacaacgtt aaagcagtga tcctggatat gttcggcgct 900

ggtacggata ccagcagcgc gacgatcgag tgggcgatta gcgagctgat tcgctgccct 960

cgcgcgatgg agaaagtgca gacggaattg cgtcaggcac tgaatggcaa agagcgtatt 1020

caggaagagg atttgcagga gctgaattat ctgaagctgg tgattaaaga aaccctgcgc 1080

ctgcatccgc cgttgccgct ggtgatgccg cgtgagtgcc gtgaaccgtg tgttttgggc 1140

ggttacgaca ttccgagcaa aacgaagctg atcgttaatg ttttcgcgat taaccgtgac 1200

ccggaatact ggaaagacgc ggaaacgttt atgccggagc gttttgagaa tagcccgatt 1260

accgttatgg gttccgagta cgaatacctg ccatttggtg ctggtcgtcg tatgtgtcct 1320

ggtgcagcgc tgggtctggc caacgtggaa ctgccgctgg cgcacattct gtactatttc 1380

aactggaaac tgccgaacgg caagaccttc gaagatttgg acatgaccga gagctttggt 1440

gccactgtgc agcgcaaaac cgagctgctg ctggttccga ccgactttca aacgctgact 1500

gcgagcacct aatgagtcga ctaactttaa gaaggagata tatccatgga acctagctct 1560

cagaaactgt ctccgttgga atttgttgct gctatcctga agggcgacta cagcagcggt 1620

caggttgaag gtggtccacc gccaggtctg gcagctatgt tgatggaaaa taaggatttg 1680

gtgatggttc tgacgacgtc cgtggcagtc ctgatcggct gtgtcgtggt cctggcatgg 1740

cgtcgtgcgg caggtagcgg taagtacaag caacctgaac tgcctaaact ggtggtcccg 1800

aaagcagccg aaccggagga ggcagaggat gataaaacca agatcagcgt gtttttcggc 1860

acccaaaccg gtacggcaga aggtttcgcg aaggcttttg ttgaagaggc caaggcgcgt 1920

tatcagcagg cccgtttcaa agttatcgac ctggacgact atgcggcaga cgatgacgag 1980

tacgaagaga aactgaagaa ggaaaacttg gcattcttct tcttggcgtc ctacggtgac 2040

ggcgagccga cggacaacgc ggcacgcttt tacaaatggt ttacggaggg taaggaccgt 2100

ggtgaatggc tgaacaatct gcagtacggc gtttttggtc tgggtaaccg tcaatatgag 2160

catttcaata agatcgccat tgtcgtcgat gatctgatct tcgagcaagg tggcaagaag 2220

ctggttccgg tgggtctggg tgacgatgac cagtgcattg aggatgattt tgcggcgtgg 2280

cgtgaactgg tctggccgga actggataaa ctgctgcgta acgaagacga cgctaccgtg 2340

gcaaccccgt acagcgccgc tgtgctgcaa taccgcgtgg ttttccacga tcacattgac 2400

ggcctgatta gcgaaaacgg tagcccgaac ggtcatgcta atggcaatac cgtgtacgat 2460

gcgcaacacc cgtgccgtag caacgtcgcg gtcaagaagg aattgcatac tccggcgagc 2520

gatcgcagct gcacccacct ggaatttaac attagcggta ccggcctgat gtacgagacg 2580

ggtgaccacg tcggtgtgta ttgcgagaac ctgttggaaa ccgtggagga ggccgagaag 2640

ttgttgaacc tgagcccgca gacgtacttc tccgttcaca ccgacaacga ggacggtacg 2700

ccgttgagcg gcagcagcct gccgccaccg tttccgccgt gcaccttgcg cacggcattg 2760

accaaatacg cagacttgac ttctgcaccg aaaaagtcgg tgctggtggc gctggccgag 2820

tacgcatctg accagggtga agcggatcgt ttgcgtttct tggcgagccc gagcggcaaa 2880

gaggaatatg cacagtacat cttggcaagc cagcgcacgc tgctggaggt catggcggag 2940

ttcccgtcgg cgaaaccgcc gctgggtgtc tttttcgcgg gtgtcgctcc gcgcctgcag 3000

ccgcgtttct attccattag ctctagcccg aagatcgcac cgttccgtat tcacgtgacc 3060

tgcgccctgg tttatgacaa atcccctacc ggtcgcgttc ataagggcat ctgtagcacg 3120

tggatgaaaa atgcggtccc gctggaagaa agcaacgatt gttcctgggc tccgatcttc 3180

gtccgcaaca gcaacttcaa gctgccgacc gacccgaagg ttccgattat catgattggt 3240

ccgggtaccg gtctggcccc ttttcgtggc tttttgcaag agcgcttggc gttgaaagag 3300

agcggtgctg aattgggtcc ggcgatcttg ttctttggtt gccgtaaccg taaaatggac 3360

tttatttacg aggatgaact gaatgatttc gtcaaagcgg gcgttgtcag cgagctgatc 3420

gtcgctttta gccgcgaagg cccgatgaaa gaatacgtgc aacacaaaat gagccaacgt 3480

gcctccgatg tgtggaacat cattagcgac ggtggttatg tttatgtttg cggtgacgcg 3540

aagggtatgg ctcgtgatgt tcaccgtacc ctgcatacca tcgcacagga gcaaggtagc 3600

atgtccagct cggaggccga aggtatggtc aaaaacctgc aaaccaccgg tcgttacctg 3660

cgtgatgtgt ggtaataaaa gcttaggagg taaaacatat ggacagcagc accgccaccg 3720

caatgaccgc accattcatc gacccgacgg atcatgtgaa tctgaaaacc gacacggatg 3780

cgagcgaaaa tcgtcgtatg ggtaactaca agccgagcat ttggaactac gattttctgc 3840

agtccctggc gacgcaccac aacattgttg aagagcgtca cctgaagctg gcagagaaac 3900

tgaaaggtca agtgaaattc atgttcggtg cgccgatgga gccattggct aagttggagc 3960

tggttgatgt ggtgcaacgc ttgggtctga accacctgtt cgagactgaa atcaaagaag 4020

ctctgttcag catctacaaa gatggcagca atggctggtg gtttggccat ctgcatgcta 4080

cctctttgcg cttccgtctg ttgcgccaat gtggcctgtt tatcccgcag gacgttttca 4140

aaacctttca aaacaagacc ggtgagtttg acatgaagct gtgggacaac gttaagggcc 4200

tgctgagcct gtacgaggcg agctacctgg gctggaaggg cgagaacatc ttggatgaag 4260

caaaggcgtt cacgaccaag tgcctgaaga gcgcatggga gaacattagc gagaagtggc 4320

tggcgaagcg tgttaaacat gcgttggcgc tgccgctgca ctggcgtgtt ccgcgtattg 4380

aagcacgctg gtttatcgag gtgtacgaac aagaggccaa tatgaatccg acgctgctga 4440

aactggcgaa actggacttc aacatggtcc aaagcattca ccagaaagaa atcggtgaac 4500

tggcccgctg gtgggttact accggcctgg acaagctgga tttcgcacgc aacaatctgt 4560

tgcagtctta tatgtggagc tgcgccatcg cgtccgaccc gaaattcaaa ctggcgcgtg 4620

aaaccattgt cgagatcggt tccgtgttga cggttgtcga cgacggctat gatgtgtacg 4680

gttctatgga tgagctggac ctgtacacca gctcggtgga gcgttggtcc tgtgtcaaaa 4740

ttgacaagct gcctaatacg ctgaagctga tctttatgtc tatgttcaac aaaaccaacg 4800

aggtgggtct gcgtgttcaa cacgagcgtg gttacaatag catcccgacc ttcattaagg 4860

cgtgggtgga acagtgtaag agctatcaaa aagaggcgcg ttggtttcat ggtggtcaca 4920

cgcctccgct ggaagaatac agcctgaacg gtctggtcag cattggtttt ccgctgttgc 4980

tgatcaccgg ctatgttgcg attgctgaga atgaagcagc cctggataaa gtccacccgc 5040

tgccggacct gctgcattat tccagcttgc tgagccgtct gattaatgat atcggcacta 5100

gcccggatga aatggcgcgt ggtgacaatc tgaagagcat tcactgctat atgaatgaaa 5160

ccggtgccag cgaagaggtc gcacgcgagc acatcaaagg cgtcatcgaa gagaattgga 5220

aaattctgaa ccagtgttgc tttgaccagt cccagttcca ggagccgttc atcacgttta 5280

acctgaacag cgtgcgcggc tcgcatttct tctatgaatt tggtgatggt tttggtgtta 5340

ccgacagctg gaccaaggtg gatatgaaaa gcgtcctgat tgatccgatt ccgctgggtg 5400

aagagtaagc ttgc 5414

<210> 27

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358A DNA 序列

<400> 27

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgg ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 28

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358A 氨基酸序列

<400> 28

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ala Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 29

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358F DNA 序列

<400> 29

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac gctgactgcg 1500

agcacctaat ga 1512

<210> 30

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358F 氨基酸序列

<400> 30

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Phe Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 31

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358T DNA 序列

<400> 31

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccga ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 32

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358T 氨基酸序列

<400> 32

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Thr Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 33

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358S DNA 序列

<400> 33

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 34

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358S 氨基酸序列

<400> 34

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ser Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 35

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358V DNA 序列

<400> 35

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgg ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 36

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358V 氨基酸序列

<400> 36

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Val Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 37

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358G DNA 序列

<400> 37

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgg ggccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 38

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358G 氨基酸序列

<400> 38

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Gly Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 39

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358I DNA 序列

<400> 39

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccga ttccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 40

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358I 氨基酸序列

<400> 40

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Ile Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 41

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358M DNA 序列

<400> 41

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccga tgccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 42

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358M 氨基酸序列

<400> 42

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Met Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 43

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358P DNA 序列

<400> 43

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgc ctccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 44

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358P 氨基酸序列

<400> 44

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Pro Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 45

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358Y DNA 序列

<400> 45

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt atccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 46

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358Y 氨基酸序列

<400> 46

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Tyr Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 47

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358W DNA 序列

<400> 47

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgt ggccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 48

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358W 氨基酸序列

<400> 48

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Trp Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 49

<211> 1512

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV8-L358R DNA 序列

<400> 49

atggctctgt tattagcagt tttttggtcg gcgcttataa tcctcgtagt aacctacacc 60

atatccctcc taatcaacca atggcgaaaa ccgaaacccc aagggaagtt ccccccgggc 120

ccatggcgtc tgccgattat cggtcacatg caccatttga tcggcaccat gccgcatcgt 180

ggtgttatgg aactggcccg taagcatggc agcctgatgc acctgcaact gggtgaagtc 240

tctacgattg ttgtcagcag cccgcgttgg gcgaaagagg tcttgaccac ctatgatatc 300

accttcgcca atcgcccgga aaccctgact ggcgagatcg tcgcatacca caacacggat 360

atcgtcctgg cgccgtatgg tgagtattgg cgtcaactgc gtaaactgtg cacgctggag 420

ctgctgagca acaagaaagt gaagagcttc cagagcctgc gcgaagaaga gtgttggaac 480

ctggtcaagg acatccgcag caccggccaa ggtagcccaa tcaatctgtc ggagaacatt 540

ttcaagatga ttgcgacgat tctgagccgt gctgcgttcg gtaagggtat taaggatcaa 600

atgaagttta ccgaactggt gaaagaaatc ctgcgtctga ccggcggttt tgatgtcgct 660

gacatcttcc ctagcaagaa gttgctgcac cacctgagcg gcaagcgtgc aaaactgacc 720

aatatccata acaagctgga taatctgatc aataacatca tcgcagagca cccgggcaac 780

cgtacctcgt cctcccagga aacgctgctg gacgttctgc tgcgcctgaa agagtctgcg 840

gagtttccgc tgaccgccga caacgttaaa gcagtgatcc tggatatgtt cggcgctggt 900

acggatacca gcagcgcgac gatcgagtgg gcgattagcg agctgattcg ctgccctcgc 960

gcgatggaga aagtgcagac ggaattgcgt caggcactga atggcaaaga gcgtattcag 1020

gaagaggatt tgcaggagct gaattatctg aagctggtga ttaaagaaac cctgcgcctg 1080

catccgccgc gtccgctggt gatgccgcgt gagtgccgtg aaccgtgtgt tttgggcggt 1140

tacgacattc cgagcaaaac gaagctgatc gttaatgttt tcgcgattaa ccgtgacccg 1200

gaatactgga aagacgcgga aacgtttatg ccggagcgtt ttgagaatag cccgattacc 1260

gttatgggtt ccgagtacga atacctgcca tttggtgctg gtcgtcgtat gtgtcctggt 1320

gcagcgctgg gtctggccaa cgtggaactg ccgctggcgc acattctgta ctatttcaac 1380

tggaaactgc cgaacggcaa gaccttcgaa gatttggaca tgaccgagag ctttggtgcc 1440

actgtgcagc gcaaaaccga gctgctgctg gttccgaccg actttcaaac cctgactgcg 1500

agcacctaat ga 1512

<210> 50

<211> 502

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV8-L358R 氨基酸序列

<400> 50

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Val Thr Tyr Thr Ile Ser Leu Leu Ile Asn Gln Trp Arg Lys Pro Lys

20 25 30

Pro Gln Gly Lys Phe Pro Pro Gly Pro Trp Arg Leu Pro Ile Ile Gly

35 40 45

His Met His His Leu Ile Gly Thr Met Pro His Arg Gly Val Met Glu

50 55 60

Leu Ala Arg Lys His Gly Ser Leu Met His Leu Gln Leu Gly Glu Val

65 70 75 80

Ser Thr Ile Val Val Ser Ser Pro Arg Trp Ala Lys Glu Val Leu Thr

85 90 95

Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly Glu

100 105 110

Ile Val Ala Tyr His Asn Thr Asp Ile Val Leu Ala Pro Tyr Gly Glu

115 120 125

Tyr Trp Arg Gln Leu Arg Lys Leu Cys Thr Leu Glu Leu Leu Ser Asn

130 135 140

Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp Asn

145 150 155 160

Leu Val Lys Asp Ile Arg Ser Thr Gly Gln Gly Ser Pro Ile Asn Leu

165 170 175

Ser Glu Asn Ile Phe Lys Met Ile Ala Thr Ile Leu Ser Arg Ala Ala

180 185 190

Phe Gly Lys Gly Ile Lys Asp Gln Met Lys Phe Thr Glu Leu Val Lys

195 200 205

Glu Ile Leu Arg Leu Thr Gly Gly Phe Asp Val Ala Asp Ile Phe Pro

210 215 220

Ser Lys Lys Leu Leu His His Leu Ser Gly Lys Arg Ala Lys Leu Thr

225 230 235 240

Asn Ile His Asn Lys Leu Asp Asn Leu Ile Asn Asn Ile Ile Ala Glu

245 250 255

His Pro Gly Asn Arg Thr Ser Ser Ser Gln Glu Thr Leu Leu Asp Val

260 265 270

Leu Leu Arg Leu Lys Glu Ser Ala Glu Phe Pro Leu Thr Ala Asp Asn

275 280 285

Val Lys Ala Val Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr Ser

290 295 300

Ser Ala Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Arg Cys Pro Arg

305 310 315 320

Ala Met Glu Lys Val Gln Thr Glu Leu Arg Gln Ala Leu Asn Gly Lys

325 330 335

Glu Arg Ile Gln Glu Glu Asp Leu Gln Glu Leu Asn Tyr Leu Lys Leu

340 345 350

Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Arg Pro Leu Val Met

355 360 365

Pro Arg Glu Cys Arg Glu Pro Cys Val Leu Gly Gly Tyr Asp Ile Pro

370 375 380

Ser Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp Pro

385 390 395 400

Glu Tyr Trp Lys Asp Ala Glu Thr Phe Met Pro Glu Arg Phe Glu Asn

405 410 415

Ser Pro Ile Thr Val Met Gly Ser Glu Tyr Glu Tyr Leu Pro Phe Gly

420 425 430

Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn Val

435 440 445

Glu Leu Pro Leu Ala His Ile Leu Tyr Tyr Phe Asn Trp Lys Leu Pro

450 455 460

Asn Gly Lys Thr Phe Glu Asp Leu Asp Met Thr Glu Ser Phe Gly Ala

465 470 475 480

Thr Val Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Thr Asp Phe Gln

485 490 495

Thr Leu Thr Ala Ser Thr

500

<210> 51

<211> 1488

<212> DNA

<213> 黄花蒿(Artemisia annua)

<400> 51

atgaagagta tactaaaagc aatggcactc tcactgacca cttccattgc tcttgcaacg 60

atccttttgt tcgtttacaa gttcgctact cgttccaaat ccaccaaaaa aagccttcct 120

gagccatggc ggcttcccat tattggtcac atgcatcact tgattggtac aacgccacat 180

cgtggggtta gggatttagc cagaaagtat ggatctttga tgcatttaca gcttggtgaa 240

gttccaacaa tcgtggtgtc atctccgaaa tgggctaaag agattttgac aacgtacgac 300

attacctttg ctaacaggcc cgagacttta actggtgaga ttgttttata tcacaatacg 360

gatgttgttc ttgcacctta tggtgaatac tggaggcaat tacgtaaaat ttgcacattg 420

gagcttttga gtgttaagaa agtaaagtca tttcagtcac ttcgtgaaga ggagtgttgg 480

aatttggttc aagagattaa agcttcaggt tcagggagac cggttaacct ttcagagaat 540

gttttcaagt tgattgcaac gatacttagt agagccgcat ttgggaaagg gatcaaggac 600

cagaaagagt taacggagat tgtgaaagag atactgaggc aaactggtgg ttttgatgtg 660

gcagatatct ttccttcaaa gaaatttctt catcatcttt cgggcaagag agctcggtta 720

actagccttc gcaaaaagat cgataattta atcgataacc ttgtagctga gcatactgtt 780

aacacctcca gtaaaactaa cgagacactc ctcgatgttc ttttaaggct caaagacagt 840

gctgaattcc cattaacatc tgataacatt aaagccatca ttttggatat gtttggagca 900

ggcacagaca cttcctcatc cacaatcgaa tgggcgattt cggaactcat aaagtgtccg 960

aaagcaatgg agaaagtaca agcggaattg aggaaagcat tgaacggaaa agaaaagatc 1020

catgaggaag acattcaaga actaagctac ttgaacatgg taatcaaaga aacattgagg 1080

ttgcaccctc cactaccctt ggttctgcca agagagtgcc gccaaccagt caatttggct 1140

ggatacaaca tacccaataa gaccaaactt attgtcaacg tctttgcgat aaatagggac 1200

cctgaatatt ggaaagacgc tgaagctttc atccctgaac gatttgaaaa tagttctgca 1260

actgtcatgg gtgcagaata cgagtatctt ccgtttggag ctgggagaag gatgtgtcct 1320

ggagccgcac ttggtttagc taacgtgcag ctcccgctcg ctaatatact atatcatttc 1380

aactggaaac tccccaatgg tgtgagctat gaccagatcg acatgaccga gagctctgga 1440

gccacgatgc aaagaaagac tgagttgtta ctcgttccaa gtttctag 1488

<210> 52

<211> 495

<212> PRT

<213> 黄花蒿(Artemisia annua)

<400> 52

Met Lys Ser Ile Leu Lys Ala Met Ala Leu Ser Leu Thr Thr Ser Ile

1 5 10 15

Ala Leu Ala Thr Ile Leu Leu Phe Val Tyr Lys Phe Ala Thr Arg Ser

20 25 30

Lys Ser Thr Lys Lys Ser Leu Pro Glu Pro Trp Arg Leu Pro Ile Ile

35 40 45

Gly His Met His His Leu Ile Gly Thr Thr Pro His Arg Gly Val Arg

50 55 60

Asp Leu Ala Arg Lys Tyr Gly Ser Leu Met His Leu Gln Leu Gly Glu

65 70 75 80

Val Pro Thr Ile Val Val Ser Ser Pro Lys Trp Ala Lys Glu Ile Leu

85 90 95

Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu Thr Leu Thr Gly

100 105 110

Glu Ile Val Leu Tyr His Asn Thr Asp Val Val Leu Ala Pro Tyr Gly

115 120 125

Glu Tyr Trp Arg Gln Leu Arg Lys Ile Cys Thr Leu Glu Leu Leu Ser

130 135 140

Val Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu Glu Glu Cys Trp

145 150 155 160

Asn Leu Val Gln Glu Ile Lys Ala Ser Gly Ser Gly Arg Pro Val Asn

165 170 175

Leu Ser Glu Asn Val Phe Lys Leu Ile Ala Thr Ile Leu Ser Arg Ala

180 185 190

Ala Phe Gly Lys Gly Ile Lys Asp Gln Lys Glu Leu Thr Glu Ile Val

195 200 205

Lys Glu Ile Leu Arg Gln Thr Gly Gly Phe Asp Val Ala Asp Ile Phe

210 215 220

Pro Ser Lys Lys Phe Leu His His Leu Ser Gly Lys Arg Ala Arg Leu

225 230 235 240

Thr Ser Leu Arg Lys Lys Ile Asp Asn Leu Ile Asp Asn Leu Val Ala

245 250 255

Glu His Thr Val Asn Thr Ser Ser Lys Thr Asn Glu Thr Leu Leu Asp

260 265 270

Val Leu Leu Arg Leu Lys Asp Ser Ala Glu Phe Pro Leu Thr Ser Asp

275 280 285

Asn Ile Lys Ala Ile Ile Leu Asp Met Phe Gly Ala Gly Thr Asp Thr

290 295 300

Ser Ser Ser Thr Ile Glu Trp Ala Ile Ser Glu Leu Ile Lys Cys Pro

305 310 315 320

Lys Ala Met Glu Lys Val Gln Ala Glu Leu Arg Lys Ala Leu Asn Gly

325 330 335

Lys Glu Lys Ile His Glu Glu Asp Ile Gln Glu Leu Ser Tyr Leu Asn

340 345 350

Met Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro Leu Pro Leu Val

355 360 365

Leu Pro Arg Glu Cys Arg Gln Pro Val Asn Leu Ala Gly Tyr Asn Ile

370 375 380

Pro Asn Lys Thr Lys Leu Ile Val Asn Val Phe Ala Ile Asn Arg Asp

385 390 395 400

Pro Glu Tyr Trp Lys Asp Ala Glu Ala Phe Ile Pro Glu Arg Phe Glu

405 410 415

Asn Ser Ser Ala Thr Val Met Gly Ala Glu Tyr Glu Tyr Leu Pro Phe

420 425 430

Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu Gly Leu Ala Asn

435 440 445

Val Gln Leu Pro Leu Ala Asn Ile Leu Tyr His Phe Asn Trp Lys Leu

450 455 460

Pro Asn Gly Val Ser Tyr Asp Gln Ile Asp Met Thr Glu Ser Ser Gly

465 470 475 480

Ala Thr Met Gln Arg Lys Thr Glu Leu Leu Leu Val Pro Ser Phe

485 490 495

<210> 53

<211> 1500

<212> DNA

<213> 人工序列

<220>

<223> CYP71AV1密码子优化的DNA 序列

<400> 53

atgaccgtac acgacatcat cgcaacgtac ttcactaaat ggtacgtaat tgtgccgctg 60

gcactgattg cgtatcgcgt gctggattat ttctacgcga cccgttctaa aagcactaag 120

aaatctctgc cggaaccgtg gcgtctgcca atcatcggtc acatgcacca cctgatcggc 180

accaccccgc accgtggcgt acgcgacctg gcgcgtaagt acggctctct gatgcatctg 240

cagctgggcg aggtacctac tatcgtcgtt tcctccccga agtgggccaa agaaatcctg 300

actacctatg acatcacttt cgccaaccgc ccggaaacgc tgaccggcga aattgtcctg 360

taccataaca cggatgtggt tctggccccg tacggtgagt actggcgcca gctgcgcaaa 420

atttgtactc tggaactgct gagcgttaaa aaggttaaat ccttccagag cctgcgtgaa 480

gaggaatgct ggaacctggt gcaggagatt aaagcgtctg gcagcggtcg tccagttaac 540

ctgtctgaga atgtttttaa actgatcgct actatcctgt ctcgcgcggc attcggtaaa 600

ggtatcaaag atcagaaaga actgaccgaa atcgttaagg aaatcctgcg ccagactggt 660

ggcttcgacg ttgcggacat cttcccgtcc aaaaagttcc tgcaccatct gtctggcaaa 720

cgcgctcgtc tgacctccct gcgtaagaaa attgataacc tgattgacaa cctggtcgct 780

gagcacactg tgaacacctc ttctaaaacc aacgaaaccc tgctggacgt actgctgcgc 840

ctgaaggact ctgccgaatt tccactgact agcgacaata tcaaagcaat catcctggac 900

atgttcggcg ccggtaccga tacgtcctct tccacgattg agtgggctat ttccgaactg 960

atcaaatgcc cgaaggcgat ggaaaaagtg caggcggaac tgcgtaaagc gctgaacggt 1020

aaagagaaaa ttcatgaaga ggacatccag gaactgtcct acctgaatat ggtaatcaaa 1080

gaaactctgc gtctgcatcc gccgctgcca ctggttctgc cgcgtgaatg ccgtcagccg 1140

gttaacctgg ccggctacaa cattccgaac aaaacgaagc tgatcgtcaa cgttttcgcg 1200

atcaaccgcg atcctgaata ctggaaagac gcggaagcgt tcattccgga acgctttgag 1260

aactcctctg ccaccgttat gggcgctgaa tacgagtacc tgccgttcgg tgcgggtcgc 1320

cgtatgtgcc cgggtgctgc actgggcctg gcgaacgttc aactgccact ggcgaacatc 1380

ctgtaccact tcaactggaa actgcctaac ggcgtatctt atgatcaaat cgacatgacc 1440

gaaagctccg gcgcgaccat gcagcgtaaa accgaactgc tgctggttcc gtccttttaa 1500

<210> 54

<211> 499

<212> PRT

<213> 人工序列

<220>

<223> CYP71AV1密码子优化的氨基酸序列

<400> 54

Met Thr Val His Asp Ile Ile Ala Thr Tyr Phe Thr Lys Trp Tyr Val

1 5 10 15

Ile Val Pro Leu Ala Leu Ile Ala Tyr Arg Val Leu Asp Tyr Phe Tyr

20 25 30

Ala Thr Arg Ser Lys Ser Thr Lys Lys Ser Leu Pro Glu Pro Trp Arg

35 40 45

Leu Pro Ile Ile Gly His Met His His Leu Ile Gly Thr Thr Pro His

50 55 60

Arg Gly Val Arg Asp Leu Ala Arg Lys Tyr Gly Ser Leu Met His Leu

65 70 75 80

Gln Leu Gly Glu Val Pro Thr Ile Val Val Ser Ser Pro Lys Trp Ala

85 90 95

Lys Glu Ile Leu Thr Thr Tyr Asp Ile Thr Phe Ala Asn Arg Pro Glu

100 105 110

Thr Leu Thr Gly Glu Ile Val Leu Tyr His Asn Thr Asp Val Val Leu

115 120 125

Ala Pro Tyr Gly Glu Tyr Trp Arg Gln Leu Arg Lys Ile Cys Thr Leu

130 135 140

Glu Leu Leu Ser Val Lys Lys Val Lys Ser Phe Gln Ser Leu Arg Glu

145 150 155 160

Glu Glu Cys Trp Asn Leu Val Gln Glu Ile Lys Ala Ser Gly Ser Gly

165 170 175

Arg Pro Val Asn Leu Ser Glu Asn Val Phe Lys Leu Ile Ala Thr Ile

180 185 190

Leu Ser Arg Ala Ala Phe Gly Lys Gly Ile Lys Asp Gln Lys Glu Leu

195 200 205

Thr Glu Ile Val Lys Glu Ile Leu Arg Gln Thr Gly Gly Phe Asp Val

210 215 220

Ala Asp Ile Phe Pro Ser Lys Lys Phe Leu His His Leu Ser Gly Lys

225 230 235 240

Arg Ala Arg Leu Thr Ser Leu Arg Lys Lys Ile Asp Asn Leu Ile Asp

245 250 255

Asn Leu Val Ala Glu His Thr Val Asn Thr Ser Ser Lys Thr Asn Glu

260 265 270

Thr Leu Leu Asp Val Leu Leu Arg Leu Lys Asp Ser Ala Glu Phe Pro

275 280 285

Leu Thr Ser Asp Asn Ile Lys Ala Ile Ile Leu Asp Met Phe Gly Ala

290 295 300

Gly Thr Asp Thr Ser Ser Ser Thr Ile Glu Trp Ala Ile Ser Glu Leu

305 310 315 320

Ile Lys Cys Pro Lys Ala Met Glu Lys Val Gln Ala Glu Leu Arg Lys

325 330 335

Ala Leu Asn Gly Lys Glu Lys Ile His Glu Glu Asp Ile Gln Glu Leu

340 345 350

Ser Tyr Leu Asn Met Val Ile Lys Glu Thr Leu Arg Leu His Pro Pro

355 360 365

Leu Pro Leu Val Leu Pro Arg Glu Cys Arg Gln Pro Val Asn Leu Ala

370 375 380

Gly Tyr Asn Ile Pro Asn Lys Thr Lys Leu Ile Val Asn Val Phe Ala

385 390 395 400

Ile Asn Arg Asp Pro Glu Tyr Trp Lys Asp Ala Glu Ala Phe Ile Pro

405 410 415

Glu Arg Phe Glu Asn Ser Ser Ala Thr Val Met Gly Ala Glu Tyr Glu

420 425 430

Tyr Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Ala Ala Leu

435 440 445

Gly Leu Ala Asn Val Gln Leu Pro Leu Ala Asn Ile Leu Tyr His Phe

450 455 460

Asn Trp Lys Leu Pro Asn Gly Val Ser Tyr Asp Gln Ile Asp Met Thr

465 470 475 480

Glu Ser Ser Gly Ala Thr Met Gln Arg Lys Thr Glu Leu Leu Leu Val

485 490 495

Pro Ser Phe

<210> 55

<211> 3150

<212> DNA

<213> 巨大芽孢杆菌(Bacillus megaterium)

<400> 55

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactgctcct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 56

<211> 1049

<212> PRT

<213> 巨大芽孢杆菌(Bacillus megaterium)

<400> 56

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 57

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体7 DNA 序列

<400> 57

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt aatcacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactatccct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 58

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体7 氨基酸序列

<400> 58

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Ile Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 59

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体17 DNA 序列

<400> 59

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactatccct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 60

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体17 氨基酸序列

<400> 60

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Ile Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 61

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体18 DNA 序列

<400> 61

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactctgcct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 62

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体18 氨基酸序列

<400> 62

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Leu Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 63

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体19 DNA 序列

<400> 63

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactgttcct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 64

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体19 氨基酸序列

<400> 64

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Val Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 65

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体20 DNA 序列

<400> 65

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt agttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aacttttcct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 66

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体20 氨基酸序列

<400> 66

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Phe Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 67

<211> 3150

<212> DNA

<213> 人工序列

<220>

<223> P450-BM3变体23 DNA 序列

<400> 67

atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60

ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120

tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180

gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240

gattttgcag gagacgggtt atttacaagc tggacgcatg aaaaaaattg gaaaaaagcg 300

cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360

gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420

gaagtaccgg aagacatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480

tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540

gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600

gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660

attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca catgctaaac 720

ggaaaagatc cagaaacggg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780

acattcttaa ttgcgggaca cgaaacaaca agtggtcttt tatcatttgc gctgtatttc 840

ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900

gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960

gaagcgctgc gcttatggcc aactgttcct gcgttttccc tatatgcaaa agaagatacg 1020

gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080

cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140

gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200

tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260

cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320

aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380

tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440

acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500

ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560

gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620

ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680

aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740

aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800

cgcggtgaag cagatgcaag cgacgacttt gaaggcacct atgaagaatg gcgtgaacac 1860

atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920

tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980

ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040

agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100

ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160

ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220

ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280

acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340

cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400

atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460

cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520

aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580

tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640

tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700

atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760

ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820

catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880

cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940

gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000

ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060

gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120

cgatacgcaa aagacgtgtg ggctgggtaa 3150

<210> 68

<211> 1049

<212> PRT

<213> 人工序列

<220>

<223> P450-BM3变体23 氨基酸序列

<400> 68

Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys

1 5 10 15

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys

20 25 30

Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg

35 40 45

Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp

50 55 60

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg

65 70 75 80

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn

85 90 95

Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala

100 105 110

Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val

115 120 125

Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu

130 135 140

Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn

145 150 155 160

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr

165 170 175

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala

180 185 190

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu

195 200 205

Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg

210 215 220

Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn

225 230 235 240

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg

245 250 255

Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly

260 265 270

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu

275 280 285

Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro

290 295 300

Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn

305 310 315 320

Glu Ala Leu Arg Leu Trp Pro Thr Val Pro Ala Phe Ser Leu Tyr Ala

325 330 335

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp

340 345 350

Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp

355 360 365

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser

370 375 380

Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala

385 390 395 400

Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly

405 410 415

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu

420 425 430

Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys

435 440 445

Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr

450 455 460

Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn

465 470 475 480

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly

485 490 495

Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro

500 505 510

Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly

515 520 525

Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn

530 535 540

Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val

545 550 555 560

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala

565 570 575

Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala

580 585 590

Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp

595 600 605

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp

610 615 620

Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys

625 630 635 640

Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu

645 650 655

Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu

660 665 670

Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu

675 680 685

Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile

690 695 700

Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly

705 710 715 720

Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu

725 730 735

Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln

740 745 750

Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met

755 760 765

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu

770 775 780

Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr

785 790 795 800

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser

805 810 815

Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile

820 825 830

Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser

835 840 845

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile

850 855 860

Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys

865 870 875 880

Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu

885 890 895

Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg

900 905 910

Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu

915 920 925

Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr

930 935 940

Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr

945 950 955 960

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val

965 970 975

Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp

980 985 990

Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro

995 1000 1005

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln

1010 1015 1020

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu

1025 1030 1035

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly

1040 1045

<210> 69

<211> 2051

<212> DNA

<213> 檀香树(Santalum album)

<400> 69

atgtacgtat ccatcagcaa tgatcgacct tataaaggag ccgagacact ctcaccttca 60

atccactcat ccctacattc ttttgctaac tcctttgttg ccagcaagta tatctcttac 120

gttaaacgtt ttacttcctc aacatgtctc cggcaacagc cgttatcctc actctcctcg 180

tggccctagg gctatccatc cttttgcggc ggcgccaaaa aagaaataat ctacctcccg 240

gtccacccgc tttaccgatc atcggaaaca tccacatatt ggggaccctt cctcaccaga 300

gcctctacaa cttggccaag aagtatggtc ccatcatgtc aatgaggctg gggctcgtgc 360

cggctgttgt gatatcctct ccggaggccg ccgagctcgt cctcaagacc cacgatatcg 420

ttttcgccag ccggcccaga ctccaagttg cggactactt ccattacggg acaaagggcg 480

tcatcctgac ggagtatggt acatattggc gcaacatgcg aaggctgtgc accgtgaagc 540

ttctcaacac ggtgaaaatc gattctttcg cagggacaag gaagaaggag gtggcatcgt 600

tcgtgcagtc ccttaaggag gcttcggtgg cacacaaaat ggtgaatttg agcgcgaggg 660

tggcgaacgt cattgaaaac atggtgtgcc ttatggtgat cgggcgaagt agcgatgaga 720

ggtttaagct aaaggaggtc atccaggagg cagcgcagtt ggcgggagct ttcaatatag 780

gggattatgt tccattcctt atgccccttg acctacaggg attaactcgg cgcataaagt 840

caggaagtaa agctttcgac gacatcttgg aagtcataat cgacgagcac gtgcaagaca 900

ttaaggacca tgatgatgaa caacatggag acttcattga tgtgttgctg gcaatgatga 960

acaagcccat ggattcgcgg gagggtctta gtatcattga ccgaacaaac atcaaagcga 1020

tcctagtgga catgattgga gctgcaatgg acacttcaac aagtggcgtc gagtgggcga 1080

tttcagagct catcaagcat ccgcgggtaa tgaaaaagct ccaagacgag gtcaaaactg 1140

tcatcggaat gaataggatg gtcgaggagg ccgacttgcc taagctacca tacctcgaca 1200

tggtagtgaa agagaccatg aggttacacc ctcctggacc attgctcgtg ccccgagagt 1260

ccatggaaga catcacaatc aacggatact acatacctaa gaaatcgcga atcattgtca 1320

acgcctgggc aattgggcgt gatacaaacg cctggtctaa taacgcgcac gagttcttcc 1380

cagagaggtt tatgagtagc aatgtggact tacagggaca agatttccaa cttatcccat 1440

tcgggtcagg tcggagaggg tgccccggga tgcgcctagg cctcacaacc gttcgattag 1500

tgttagcgca gctcattcat tgtttcgact tggagcttcc taagggaacc gtggcgaccg 1560

acttggacat gagtgagaaa ttcgggttgg caatgcccag agcccagcac ttgcttgcat 1620

ttccaaccta tcgcttggag tcctaaacca ttgaggaaga tgcgtttata tttcatattg 1680

cagtgttaca ataagtagca gtcgttttca tggtgaagag gcaattcccc ctacactacc 1740

tgtcttatgc tatgcccctc cccaactttc accgtatgtg tcttgtcatc atgtatcatg 1800

tccacatcaa taagatatta tatagaaatt gtcggtacgc caagatcgga ctcaatatgt 1860

atcagctttg agctctgtac acaaaatttg atacacgaac agagaaggtc gcgaattttg 1920

ggccactcgt ctcagatata tacccttcaa gtggctaatg gggagatccc tctcctttgc 1980

atttaaagcc tctgcttccc gaaccctagc ccacaaaatt ttggccgaaa ccggataggc 2040

atacacgaca g 2051

<210> 70

<211> 1503

<212> DNA

<213> 檀香树(Santalum album)

<400> 70

atgtctccgg caacagccgt tatcctcact ctcctcgtgg ccctagggct atccatcctt 60

ttgcggcggc gccaaaaaag aaataatcta cctcccggtc cacccgcttt accgatcatc 120

ggaaacatcc acatattggg gacccttcct caccagagcc tctacaactt ggccaagaag 180

tatggtccca tcatgtcaat gaggctgggg ctcgtgccgg ctgttgtgat atcctctccg 240

gaggccgccg agctcgtcct caagacccac gatatcgttt tcgccagccg gcccagactc 300

caagttgcgg actacttcca ttacgggaca aagggcgtca tcctgacgga gtatggtaca 360

tattggcgca acatgcgaag gctgtgcacc gtgaagcttc tcaacacggt gaaaatcgat 420

tctttcgcag ggacaaggaa gaaggaggtg gcatcgttcg tgcagtccct taaggaggct 480

tcggtggcac acaaaatggt gaatttgagc gcgagggtgg cgaacgtcat tgaaaacatg 540

gtgtgcctta tggtgatcgg gcgaagtagc gatgagaggt ttaagctaaa ggaggtcatc 600

caggaggcag cgcagttggc gggagctttc aatatagggg attatgttcc attccttatg 660

ccccttgacc tacagggatt aactcggcgc ataaagtcag gaagtaaagc tttcgacgac 720

atcttggaag tcataatcga cgagcacgtg caagacatta aggaccatga tgatgaacaa 780

catggagact tcattgatgt gttgctggca atgatgaaca agcccatgga ttcgcgggag 840

ggtcttagta tcattgaccg aacaaacatc aaagcgatcc tagtggacat gattggagct 900

gcaatggaca cttcaacaag tggcgtcgag tgggcgattt cagagctcat caagcatccg 960

cgggtaatga aaaagctcca agacgaggtc aaaactgtca tcggaatgaa taggatggtc 1020

gaggaggccg acttgcctaa gctaccatac ctcgacatgg tagtgaaaga gaccatgagg 1080

ttacaccctc ctggaccatt gctcgtgccc cgagagtcca tggaagacat cacaatcaac 1140

ggatactaca tacctaagaa atcgcgaatc attgtcaacg cctgggcaat tgggcgtgat 1200

acaaacgcct ggtctaataa cgcgcacgag ttcttcccag agaggtttat gagtagcaat 1260

gtggacttac agggacaaga tttccaactt atcccattcg ggtcaggtcg gagagggtgc 1320

cccgggatgc gcctaggcct cacaaccgtt cgattagtgt tagcgcagct cattcattgt 1380

ttcgacttgg agcttcctaa gggaaccgtg gcgaccgact tggacatgag tgagaaattc 1440

gggttggcaa tgcccagagc ccagcacttg cttgcatttc caacctatcg cttggagtcc 1500

taa 1503

<210> 71

<211> 500

<212> PRT

<213> 檀香树(Santalum album)

<400> 71

Met Ser Pro Ala Thr Ala Val Ile Leu Thr Leu Leu Val Ala Leu Gly

1 5 10 15

Leu Ser Ile Leu Leu Arg Arg Arg Gln Lys Arg Asn Asn Leu Pro Pro

20 25 30

Gly Pro Pro Ala Leu Pro Ile Ile Gly Asn Ile His Ile Leu Gly Thr

35 40 45

Leu Pro His Gln Ser Leu Tyr Asn Leu Ala Lys Lys Tyr Gly Pro Ile

50 55 60

Met Ser Met Arg Leu Gly Leu Val Pro Ala Val Val Ile Ser Ser Pro

65 70 75 80

Glu Ala Ala Glu Leu Val Leu Lys Thr His Asp Ile Val Phe Ala Ser

85 90 95

Arg Pro Arg Leu Gln Val Ala Asp Tyr Phe His Tyr Gly Thr Lys Gly

100 105 110

Val Ile Leu Thr Glu Tyr Gly Thr Tyr Trp Arg Asn Met Arg Arg Leu

115 120 125

Cys Thr Val Lys Leu Leu Asn Thr Val Lys Ile Asp Ser Phe Ala Gly

130 135 140

Thr Arg Lys Lys Glu Val Ala Ser Phe Val Gln Ser Leu Lys Glu Ala

145 150 155 160

Ser Val Ala His Lys Met Val Asn Leu Ser Ala Arg Val Ala Asn Val

165 170 175

Ile Glu Asn Met Val Cys Leu Met Val Ile Gly Arg Ser Ser Asp Glu

180 185 190

Arg Phe Lys Leu Lys Glu Val Ile Gln Glu Ala Ala Gln Leu Ala Gly

195 200 205

Ala Phe Asn Ile Gly Asp Tyr Val Pro Phe Leu Met Pro Leu Asp Leu

210 215 220

Gln Gly Leu Thr Arg Arg Ile Lys Ser Gly Ser Lys Ala Phe Asp Asp

225 230 235 240

Ile Leu Glu Val Ile Ile Asp Glu His Val Gln Asp Ile Lys Asp His

245 250 255

Asp Asp Glu Gln His Gly Asp Phe Ile Asp Val Leu Leu Ala Met Met

260 265 270

Asn Lys Pro Met Asp Ser Arg Glu Gly Leu Ser Ile Ile Asp Arg Thr

275 280 285

Asn Ile Lys Ala Ile Leu Val Asp Met Ile Gly Ala Ala Met Asp Thr

290 295 300

Ser Thr Ser Gly Val Glu Trp Ala Ile Ser Glu Leu Ile Lys His Pro

305 310 315 320

Arg Val Met Lys Lys Leu Gln Asp Glu Val Lys Thr Val Ile Gly Met

325 330 335

Asn Arg Met Val Glu Glu Ala Asp Leu Pro Lys Leu Pro Tyr Leu Asp

340 345 350

Met Val Val Lys Glu Thr Met Arg Leu His Pro Pro Gly Pro Leu Leu

355 360 365

Val Pro Arg Glu Ser Met Glu Asp Ile Thr Ile Asn Gly Tyr Tyr Ile

370 375 380

Pro Lys Lys Ser Arg Ile Ile Val Asn Ala Trp Ala Ile Gly Arg Asp

385 390 395 400

Thr Asn Ala Trp Ser Asn Asn Ala His Glu Phe Phe Pro Glu Arg Phe

405 410 415

Met Ser Ser Asn Val Asp Leu Gln Gly Gln Asp Phe Gln Leu Ile Pro

420 425 430

Phe Gly Ser Gly Arg Arg Gly Cys Pro Gly Met Arg Leu Gly Leu Thr

435 440 445

Thr Val Arg Leu Val Leu Ala Gln Leu Ile His Cys Phe Asp Leu Glu

450 455 460

Leu Pro Lys Gly Thr Val Ala Thr Asp Leu Asp Met Ser Glu Lys Phe

465 470 475 480

Gly Leu Ala Met Pro Arg Ala Gln His Leu Leu Ala Phe Pro Thr Tyr

485 490 495

Arg Leu Glu Ser

500

<210> 72

<211> 1534

<212> DNA

<213> 人工序列

<220>

<223> SaCP120293, 用作SaCP816优化的DNA序列

<400> 72

aggaggtaaa acatatggca ctgttgttgg cggttttctg gagcgctttg attattctgg 60

ttagcatctt attgcgtcgt cgtcaaaaac gcaacaattt gccaccgggc ccaccggccc 120

tgccgatcat cggtaacatt cacattctgg gcaccctgcc gcaccagagc ctgtacaatc 180

tggcgaagaa gtacggtccg atcatgtcca tgcgtttggg cttggttccg gcggtggtca 240

tcagcagccc ggaagcggcc gagctggtcc tgaaaaccca cgacatcgtt tttgcttctc 300

gccctcgtct gcaagttgca gattactttc actatggcac caaaggcgtg attctgaccg 360

aatatggtac ctactggcgt aacatgcgtc gcctgtgcac ggtcaaactg ctgaacaccg 420

ttaagattga tagctttgca ggcacccgca agaaagaagt cgctagcttc gttcagagcc 480

tgaaagaagc aagcgtggcg cacaaaatgg ttaacctgtc cgcacgcgtc gctaatgtta 540

ttgagaatat ggtttgtctg atggttattg gtagatcgtc tgacgagcgt ttcaagctga 600

aagaagtgat ccaagaagcg gcacagctgg cgggtgcctt caatattggt gactatgtcc 660

cgtttctgat gccgctggat ctgcagggcc tgactcgccg tatcaagagc ggtagcaagg 720

cattcgatga catcctcgag gtcattatcg acgagcatgt gcaagacatt aaagatcatg 780

acgatgagca gcatggtgac ttcatcgacg tgctgctggc gatgatgaat aagccgatgg 840

attctcgtga gggtctgtcc atcattgatc gcacgaacat taaagcgatc ctggtggata 900

tgatcggtgc cgcgatggac acgagcacca gcggtgtgga gtgggcgatt tcggagctga 960

ttaagcatcc tcgtgtcatg aagaaactgc aagacgaagt gaaaaccgta atcggtatga 1020

accgcatggt ggaagaagcg gatctgccga aactgccgta cctggacatg gttgtcaagg 1080

aaacgatgcg tctgcatccg ccaggcccgc tgctggtgcc gcgtgaaagc atggaagata 1140

ttacgatcaa cggttactat atcccgaaga aatcccgcat tattgtgaat gcatgggcga 1200

tcggccgtga caccaacgcc tggagcaata atgcgcacga gtttttccct gagcgtttta 1260

tgagctctaa cgttgatctg caaggccagg acttccagct gatcccgttc ggtagcggtc 1320

gtcgcggttg tccgggcatg cgtctgggtc tgacgacggt ccgcttggtg ctggcccaac 1380

tgattcactg cttcgacctg gagcttccga agggcaccgt cgcgactgac ctggatatga 1440

gcgagaagtt tggtctggca atgccgcgtg cgcagcactt actggccttt ccgacctacc 1500

gtctggagag ctaagtcgac accatggaaa gctt 1534

<210> 73

<211> 499

<212> PRT

<213> 人工序列

<220>

<223> SaCP120293 氨基酸序列, N-末端经修饰的SaCP816

<400> 73

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Val

1 5 10 15

Ser Ile Leu Leu Arg Arg Arg Gln Lys Arg Asn Asn Leu Pro Pro Gly

20 25 30

Pro Pro Ala Leu Pro Ile Ile Gly Asn Ile His Ile Leu Gly Thr Leu

35 40 45

Pro His Gln Ser Leu Tyr Asn Leu Ala Lys Lys Tyr Gly Pro Ile Met

50 55 60

Ser Met Arg Leu Gly Leu Val Pro Ala Val Val Ile Ser Ser Pro Glu

65 70 75 80

Ala Ala Glu Leu Val Leu Lys Thr His Asp Ile Val Phe Ala Ser Arg

85 90 95

Pro Arg Leu Gln Val Ala Asp Tyr Phe His Tyr Gly Thr Lys Gly Val

100 105 110

Ile Leu Thr Glu Tyr Gly Thr Tyr Trp Arg Asn Met Arg Arg Leu Cys

115 120 125

Thr Val Lys Leu Leu Asn Thr Val Lys Ile Asp Ser Phe Ala Gly Thr

130 135 140

Arg Lys Lys Glu Val Ala Ser Phe Val Gln Ser Leu Lys Glu Ala Ser

145 150 155 160

Val Ala His Lys Met Val Asn Leu Ser Ala Arg Val Ala Asn Val Ile

165 170 175

Glu Asn Met Val Cys Leu Met Val Ile Gly Arg Ser Ser Asp Glu Arg

180 185 190

Phe Lys Leu Lys Glu Val Ile Gln Glu Ala Ala Gln Leu Ala Gly Ala

195 200 205

Phe Asn Ile Gly Asp Tyr Val Pro Phe Leu Met Pro Leu Asp Leu Gln

210 215 220

Gly Leu Thr Arg Arg Ile Lys Ser Gly Ser Lys Ala Phe Asp Asp Ile

225 230 235 240

Leu Glu Val Ile Ile Asp Glu His Val Gln Asp Ile Lys Asp His Asp

245 250 255

Asp Glu Gln His Gly Asp Phe Ile Asp Val Leu Leu Ala Met Met Asn

260 265 270

Lys Pro Met Asp Ser Arg Glu Gly Leu Ser Ile Ile Asp Arg Thr Asn

275 280 285

Ile Lys Ala Ile Leu Val Asp Met Ile Gly Ala Ala Met Asp Thr Ser

290 295 300

Thr Ser Gly Val Glu Trp Ala Ile Ser Glu Leu Ile Lys His Pro Arg

305 310 315 320

Val Met Lys Lys Leu Gln Asp Glu Val Lys Thr Val Ile Gly Met Asn

325 330 335

Arg Met Val Glu Glu Ala Asp Leu Pro Lys Leu Pro Tyr Leu Asp Met

340 345 350

Val Val Lys Glu Thr Met Arg Leu His Pro Pro Gly Pro Leu Leu Val

355 360 365

Pro Arg Glu Ser Met Glu Asp Ile Thr Ile Asn Gly Tyr Tyr Ile Pro

370 375 380

Lys Lys Ser Arg Ile Ile Val Asn Ala Trp Ala Ile Gly Arg Asp Thr

385 390 395 400

Asn Ala Trp Ser Asn Asn Ala His Glu Phe Phe Pro Glu Arg Phe Met

405 410 415

Ser Ser Asn Val Asp Leu Gln Gly Gln Asp Phe Gln Leu Ile Pro Phe

420 425 430

Gly Ser Gly Arg Arg Gly Cys Pro Gly Met Arg Leu Gly Leu Thr Thr

435 440 445

Val Arg Leu Val Leu Ala Gln Leu Ile His Cys Phe Asp Leu Glu Leu

450 455 460

Pro Lys Gly Thr Val Ala Thr Asp Leu Asp Met Ser Glu Lys Phe Gly

465 470 475 480

Leu Ala Met Pro Arg Ala Gln His Leu Leu Ala Phe Pro Thr Tyr Arg

485 490 495

Leu Glu Ser

<210> 74

<211> 3672

<212> DNA

<213> 人工序列

<220>

<223> 用作编码SaCP816和CPRm的合成操纵子

<400> 74

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc tt 3672

<210> 75

<211> 5349

<212> DNA

<213> 人工序列

<220>

<223> 用作编码SaCP816、CPRm和ClASS的合成操纵子

<400> 75

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttgaaggaga tatactaatg tctacccagc aggttagctc cgagaatatc 3720

gttcgcaacg cggcgaactt ccacccgaat atctggggta atcatttctt gacgtgtcca 3780

agccagacga tcgattcttg gacgcaacaa caccataaag agctgaaaga agaggtccgc 3840

aagatgatgg tgagcgacgc aaacaaaccg gcacaacgtc tgcgtctgat tgacaccgtt 3900

caacgtttgg gcgtggcgta tcatttcgaa aaagaaatcg atgacgctct ggaaaagatc 3960

ggtcacgatc cgtttgacga taaggatgac ctgtatatcg ttagcctgtg ttttcgcctg 4020

ctgcgtcagc atggcatcaa gattagctgc gatgtttttg agaagttcaa agacgacgat 4080

ggcaagttta aggcttccct gatgaatgat gtccaaggta tgctgtcgtt gtatgaagcg 4140

gcccacctgg caattcatgg cgaggacatc ctggatgagg ctattgtctt tacgaccacc 4200

cacctgaaga gcaccgtttc taactccccg gtcaattcca cctttgcgga acagattcgc 4260

cacagcctgc gtgtgccgct gcgtaaggca gtcccgcgtt tggagagccg ctacttcctg 4320

gatatctata gccgtgacga cctgcacgac aagactctgc tgaactttgc caaactggac 4380

ttcaacatcc tgcaggcgat gcaccagaaa gaggcaagcg agatgacccg ttggtggcgt 4440

gatttcgatt tcctgaagaa gctgccgtac attcgtgatc gcgtggttga actgtacttt 4500

tggattttgg tcggtgtgag ctaccaaccg aaattcagca cgggtcgtat ctttttgagc 4560

aagattatct gtctggaaac cctggtggac gacacgtttg atgcgtacgg tactttcgac 4620

gaactggcca ttttcaccga ggccgttacg cgttgggacc tgggtcatcg cgacgcgctg 4680

cctgagtaca tgaaattcat tttcaagacc ctgattgatg tgtacagcga ggcggaacaa 4740

gagctggcaa aagagggccg ctcctatagc attcactatg cgatccgtag cttccaggag 4800

ttggtcatga agtacttttg cgaggcgaaa tggctgaata agggttatgt tccgagcctg 4860

gatgactaca agagcgtcag cctgcgcagc atcggcttcc tgccgatcgc cgtggcttct 4920

tttgttttca tgggcgacat tgctacgaaa gaggtttttg agtgggaaat gaataacccg 4980

aaaatcatca tcgcagccga aaccattttc cgctttctgg atgacattgc aggtcatcgc 5040

ttcgaacaaa aacgtgagca cagcccgagc gcaatcgagt gctacaaaaa ccaacatggt 5100

gtctcggaag aagaggcagt gaaagcgctg agcttggagg tcgccaattc gtggaaagac 5160

attaacgaag agctgctgct gaaccctatg gcaattccac tgccgttgct gcaggtgatc 5220

ctggatttga gccgtagcgc ggacttcatg tacggtaatg cgcaggaccg tttcacgcac 5280

tccaccatga tgaaagatca agttgacctg gttctgaaag atccggtgaa actggacgat 5340

taagaattc 5349

<210> 76

<211> 5402

<212> DNA

<213> 人工序列

<220>

<223> 用作编码SaCP816、CPRm和SaSAS的合成操纵子

<400> 76

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttaggaggta aaacatatgg acagcagcac cgccaccgca atgaccgcac 3720

cattcatcga cccgacggat catgtgaatc tgaaaaccga cacggatgcg agcgaaaatc 3780

gtcgtatggg taactacaag ccgagcattt ggaactacga ttttctgcag tccctggcga 3840

cgcaccacaa cattgttgaa gagcgtcacc tgaagctggc agagaaactg aaaggtcaag 3900

tgaaattcat gttcggtgcg ccgatggagc cattggctaa gttggagctg gttgatgtgg 3960

tgcaacgctt gggtctgaac cacctgttcg agactgaaat caaagaagct ctgttcagca 4020

tctacaaaga tggcagcaat ggctggtggt ttggccatct gcatgctacc tctttgcgct 4080

tccgtctgtt gcgccaatgt ggcctgttta tcccgcagga cgttttcaaa acctttcaaa 4140

acaagaccgg tgagtttgac atgaagctgt gggacaacgt taagggcctg ctgagcctgt 4200

acgaggcgag ctacctgggc tggaagggcg agaacatctt ggatgaagca aaggcgttca 4260

cgaccaagtg cctgaagagc gcatgggaga acattagcga gaagtggctg gcgaagcgtg 4320

ttaaacatgc gttggcgctg ccgctgcact ggcgtgttcc gcgtattgaa gcacgctggt 4380

ttatcgaggt gtacgaacaa gaggccaata tgaatccgac gctgctgaaa ctggcgaaac 4440

tggacttcaa catggtccaa agcattcacc agaaagaaat cggtgaactg gcccgctggt 4500

gggttactac cggcctggac aagctggatt tcgcacgcaa caatctgttg cagtcttata 4560

tgtggagctg cgccatcgcg tccgacccga aattcaaact ggcgcgtgaa accattgtcg 4620

agatcggttc cgtgttgacg gttgtcgacg acggctatga tgtgtacggt tctatggatg 4680

agctggacct gtacaccagc tcggtggagc gttggtcctg tgtcaaaatt gacaagctgc 4740

ctaatacgct gaagctgatc tttatgtcta tgttcaacaa aaccaacgag gtgggtctgc 4800

gtgttcaaca cgagcgtggt tacaatagca tcccgacctt cattaaggcg tgggtggaac 4860

agtgtaagag ctatcaaaaa gaggcgcgtt ggtttcatgg tggtcacacg cctccgctgg 4920

aagaatacag cctgaacggt ctggtcagca ttggttttcc gctgttgctg atcaccggct 4980

atgttgcgat tgctgagaat gaagcagccc tggataaagt ccacccgctg ccggacctgc 5040

tgcattattc cagcttgctg agccgtctga ttaatgatat cggcactagc ccggatgaaa 5100

tggcgcgtgg tgacaatctg aagagcattc actgctatat gaatgaaacc ggtgccagcg 5160

aagaggtcgc acgcgagcac atcaaaggcg tcatcgaaga gaattggaaa attctgaacc 5220

agtgttgctt tgaccagtcc cagttccagg agccgttcat cacgtttaac ctgaacagcg 5280

tgcgcggctc gcatttcttc tatgaatttg gtgatggttt tggtgttacc gacagctgga 5340

ccaaggtgga tatgaaaagc gtcctgattg atccgattcc gctgggtgaa gagtaagctt 5400

gc 5402

<210> 77

<211> 1880

<212> DNA

<213> 檀香树(Santalum album)

<400> 77

atataaaagc aatagagaaa cgcactttcc cacaccatcc caccagtaag tcactttgcc 60

caagtcccta atacggtgga aagggcaaaa aaaaataacg gaaagggtaa aatatcccgc 120

aaatgtctcc gaccactgtc gccgtcgccg tcgccatcat cggagcactc tggctcctca 180

cgcgaaagcg ccggaagggg ccgggcctcc cgccaggccc acgggcctac ccgatcatcg 240

ggaacctcca catgatgggc cagctcccgc accacaacct ccgcgagctg gcccgggagt 300

acggccccat catgtcgatg cggctcggcc tcgtccccgc catcgtggtc tcctccccgg 360

aggcggcgca gctcttcctg aagacgcatg atacggtgtt cgcgagccgg ccgaagacgg 420

agacggcgaa gtacttccac tacgggatca agggtctcat cctgaccgag tacgggccgt 480

actggcgcaa catccggcgg ctgagcacgg tcaagctgct gaacgcggcg aagatcgatt 540

cgttcgcggc gatgaggcgg agcgaggtgg agaggctggt ggcgtcggtg agggggtcgg 600

cggtgcggcg ggaggtggtg gacgtgagct cgaaggtggc ggaggcaatg gagaacatgg 660

tgtgtcagat ggtgattggg aggagtgggg acgataggtt taagctgaag gagacgtttc 720

aggaggggac tcagttggcc ggagctttca attttgggga gttcgttccc tttctcctgc 780

cacttgacct tcagggaata acacggcgca taaaagaagt aagcacgagg ttcaacaaaa 840

tcttggattt aatcgtcgac gagcacatca gagacgccgc tggaaccaaa aattccggcg 900

gtcgagacag cgacaacttc ctcgacgtcc tcctttccct aatgaacacc tccatcagcg 960

actccaacga caccggcgac aacaaccgca acaacgtcat tgaacgagac aacatcaaag 1020

cgatcctcac cgatatgctc ggcgccgcca tggacacctc cgccagcacc gtcgagtgga 1080

ccatctccga gctcttccgc cacccaaaaa caatgcaaaa gctccaggcc gagattcggg 1140

gtgtcgtggg cccgacccgg aacgtgtctg aagacgacct cccaaagctc acttacctgg 1200

acatggtggt gaaggagggg atgcggcttc acccggcggt gccgctgctc ctcccccacg 1260

agtccctgga ggaggcgaca atcgatggtt attacattcc gaaggggtct cggatcctga 1320

tcaatgtgtg ggccatcggg cgcgacccga aggcctggcc tgatcgcccg gaggagttca 1380

tcccggagag gtttgagaaa agcaatgtgg atgtgctggg gagggatttc caactccttc 1440

cgttcggctc gggccgtaga gggtgcgccg ggattcggtt agggttgatt ttcgtgcgat 1500

tggtgctagc tcagctggtg cattgtttcg attgggagct cgcccgcaac atggcttcgt 1560

caccggagaa gttggacatg gaagagaagt tcgggctagc tgtgcataga gttaaccatt 1620

tgaaagcact gccgacttat cgcttggaat gctaaaagtt gctttctacc tatatatata 1680

cactcgctag gaaataaatg atgttttcaa atggaataat tttctttttt aatgaaatag 1740

cataagtatt gttggttgtt atttaccaaa aaaaaagaag tattgtcggt tgtttacgat 1800

ggtggtatta atgtgttttg atgcatgggt atatccatca ttttatttta acttagctaa 1860

tttttgagtt attgatgtat 1880

<210> 78

<211> 1533

<212> DNA

<213> 檀香树(Santalum album)

<400> 78

atgtctccga ccactgtcgc cgtcgccgtc gccatcatcg gagcactctg gctcctcacg 60

cgaaagcgcc ggaaggggcc gggcctcccg ccaggcccac gggcctaccc gatcatcggg 120

aacctccaca tgatgggcca gctcccgcac cacaacctcc gcgagctggc ccgggagtac 180

ggccccatca tgtcgatgcg gctcggcctc gtccccgcca tcgtggtctc ctccccggag 240

gcggcgcagc tcttcctgaa gacgcatgat acggtgttcg cgagccggcc gaagacggag 300

acggcgaagt acttccacta cgggatcaag ggtctcatcc tgaccgagta cgggccgtac 360

tggcgcaaca tccggcggct gagcacggtc aagctgctga acgcggcgaa gatcgattcg 420

ttcgcggcga tgaggcggag cgaggtggag aggctggtgg cgtcggtgag ggggtcggcg 480

gtgcggcggg aggtggtgga cgtgagctcg aaggtggcgg aggcaatgga gaacatggtg 540

tgtcagatgg tgattgggag gagtggggac gataggttta agctgaagga gacgtttcag 600

gaggggactc agttggccgg agctttcaat tttggggagt tcgttccctt tctcctgcca 660

cttgaccttc agggaataac acggcgcata aaagaagtaa gcacgaggtt caacaaaatc 720

ttggatttaa tcgtcgacga gcacatcaga gacgccgctg gaaccaaaaa ttccggcggt 780

cgagacagcg acaacttcct cgacgtcctc ctttccctaa tgaacacctc catcagcgac 840

tccaacgaca ccggcgacaa caaccgcaac aacgtcattg aacgagacaa catcaaagcg 900

atcctcaccg atatgctcgg cgccgccatg gacacctccg ccagcaccgt cgagtggacc 960

atctccgagc tcttccgcca cccaaaaaca atgcaaaagc tccaggccga gattcggggt 1020

gtcgtgggcc cgacccggaa cgtgtctgaa gacgacctcc caaagctcac ttacctggac 1080

atggtggtga aggaggggat gcggcttcac ccggcggtgc cgctgctcct cccccacgag 1140

tccctggagg aggcgacaat cgatggttat tacattccga aggggtctcg gatcctgatc 1200

aatgtgtggg ccatcgggcg cgacccgaag gcctggcctg atcgcccgga ggagttcatc 1260

ccggagaggt ttgagaaaag caatgtggat gtgctgggga gggatttcca actccttccg 1320

ttcggctcgg gccgtagagg gtgcgccggg attcggttag ggttgatttt cgtgcgattg 1380

gtgctagctc agctggtgca ttgtttcgat tgggagctcg cccgcaacat ggcttcgtca 1440

ccggagaagt tggacatgga agagaagttc gggctagctg tgcatagagt taaccatttg 1500

aaagcactgc cgacttatcg cttggaatgc taa 1533

<210> 79

<211> 510

<212> PRT

<213> 檀香树(Santalum album)

<400> 79

Met Ser Pro Thr Thr Val Ala Val Ala Val Ala Ile Ile Gly Ala Leu

1 5 10 15

Trp Leu Leu Thr Arg Lys Arg Arg Lys Gly Pro Gly Leu Pro Pro Gly

20 25 30

Pro Arg Ala Tyr Pro Ile Ile Gly Asn Leu His Met Met Gly Gln Leu

35 40 45

Pro His His Asn Leu Arg Glu Leu Ala Arg Glu Tyr Gly Pro Ile Met

50 55 60

Ser Met Arg Leu Gly Leu Val Pro Ala Ile Val Val Ser Ser Pro Glu

65 70 75 80

Ala Ala Gln Leu Phe Leu Lys Thr His Asp Thr Val Phe Ala Ser Arg

85 90 95

Pro Lys Thr Glu Thr Ala Lys Tyr Phe His Tyr Gly Ile Lys Gly Leu

100 105 110

Ile Leu Thr Glu Tyr Gly Pro Tyr Trp Arg Asn Ile Arg Arg Leu Ser

115 120 125

Thr Val Lys Leu Leu Asn Ala Ala Lys Ile Asp Ser Phe Ala Ala Met

130 135 140

Arg Arg Ser Glu Val Glu Arg Leu Val Ala Ser Val Arg Gly Ser Ala

145 150 155 160

Val Arg Arg Glu Val Val Asp Val Ser Ser Lys Val Ala Glu Ala Met

165 170 175

Glu Asn Met Val Cys Gln Met Val Ile Gly Arg Ser Gly Asp Asp Arg

180 185 190

Phe Lys Leu Lys Glu Thr Phe Gln Glu Gly Thr Gln Leu Ala Gly Ala

195 200 205

Phe Asn Phe Gly Glu Phe Val Pro Phe Leu Leu Pro Leu Asp Leu Gln

210 215 220

Gly Ile Thr Arg Arg Ile Lys Glu Val Ser Thr Arg Phe Asn Lys Ile

225 230 235 240

Leu Asp Leu Ile Val Asp Glu His Ile Arg Asp Ala Ala Gly Thr Lys

245 250 255

Asn Ser Gly Gly Arg Asp Ser Asp Asn Phe Leu Asp Val Leu Leu Ser

260 265 270

Leu Met Asn Thr Ser Ile Ser Asp Ser Asn Asp Thr Gly Asp Asn Asn

275 280 285

Arg Asn Asn Val Ile Glu Arg Asp Asn Ile Lys Ala Ile Leu Thr Asp

290 295 300

Met Leu Gly Ala Ala Met Asp Thr Ser Ala Ser Thr Val Glu Trp Thr

305 310 315 320

Ile Ser Glu Leu Phe Arg His Pro Lys Thr Met Gln Lys Leu Gln Ala

325 330 335

Glu Ile Arg Gly Val Val Gly Pro Thr Arg Asn Val Ser Glu Asp Asp

340 345 350

Leu Pro Lys Leu Thr Tyr Leu Asp Met Val Val Lys Glu Gly Met Arg

355 360 365

Leu His Pro Ala Val Pro Leu Leu Leu Pro His Glu Ser Leu Glu Glu

370 375 380

Ala Thr Ile Asp Gly Tyr Tyr Ile Pro Lys Gly Ser Arg Ile Leu Ile

385 390 395 400

Asn Val Trp Ala Ile Gly Arg Asp Pro Lys Ala Trp Pro Asp Arg Pro

405 410 415

Glu Glu Phe Ile Pro Glu Arg Phe Glu Lys Ser Asn Val Asp Val Leu

420 425 430

Gly Arg Asp Phe Gln Leu Leu Pro Phe Gly Ser Gly Arg Arg Gly Cys

435 440 445

Ala Gly Ile Arg Leu Gly Leu Ile Phe Val Arg Leu Val Leu Ala Gln

450 455 460

Leu Val His Cys Phe Asp Trp Glu Leu Ala Arg Asn Met Ala Ser Ser

465 470 475 480

Pro Glu Lys Leu Asp Met Glu Glu Lys Phe Gly Leu Ala Val His Arg

485 490 495

Val Asn His Leu Lys Ala Leu Pro Thr Tyr Arg Leu Glu Cys

500 505 510

<210> 80

<211> 1555

<212> DNA

<213> 人工序列

<220>

<223> SaCP120292, 用作编码N-末端经修饰的SaCP10374的优化的cDNA

<400> 80

aggaggtaaa acatatggca ctgctgctgg ctgtcttttg gagcgcactg attattctga 60

cccgcaaacg ccgcaaaggt ccgggtctgc caccgggtcc gcgtgcgtac ccgattattg 120

gcaatctgca catgatgggc cagctgccac accacaattt gcgtgagctg gcacgtgagt 180

atggtccgat tatgagcatg cgcctgggtc tggtgccggc aatcgtggtt agctctcctg 240

aggctgcgca gctgttcctc aagacgcatg ataccgtttt cgcgagccgt ccaaagaccg 300

agactgccaa atacttccat tacggtatca aaggtctgat cctgaccgag tatggcccgt 360

actggcgcaa tattcgtcgt ttgagcaccg ttaagctgtt gaatgccgcg aaaatcgata 420

gcttcgcggc tatgcgtaga agcgaagttg aacgcctggt cgcgtccgtt cgtggttcgg 480

cggttcgtcg tgaggttgtg gacgtcagca gcaaagtggc ggaagctatg gagaatatgg 540

tctgccagat ggttatcggc cgttcaggtg acgatcgttt taagctgaaa gaaacctttc 600

aagagggcac ccaactggca ggcgcgttca attttggtga gtttgtgccg tttctgctgc 660

cgctggactt gcaaggtatt acccgtcgca tcaaagaagt cagcactcgt ttcaataaga 720

ttttggacct gatcgttgac gagcacattc gcgatgccgc tggtaccaaa aacagcggcg 780

gtcgtgatag cgacaatttt ctggatgttc tgctgtcctt gatgaacacc tctattagcg 840

atagcaatga cacgggtgac aacaaccgta acaacgtgat cgagcgtgat aacattaaag 900

cgatcctgac ggacatgctg ggtgcagcga tggacacgag cgcgagcacg gtcgagtgga 960

cgatctccga actgtttcgc cacccgaaaa ccatgcagaa gctgcaagca gaaatccgtg 1020

gtgtcgtggg cccgacccgc aatgtgagcg aagatgactt gccgaagctg acctatctgg 1080

acatggtcgt taaggaaggc atgcgtttgc atccggccgt gccgctgctt ctgccgcatg 1140

agtctctgga agaagccacg atcgatggct actacattcc gaagggttcc cgcattctga 1200

tcaacgtctg ggcgattggt cgcgacccga aggcctggcc ggatcgtcct gaagagttca 1260

tcccggagcg tttcgagaaa agcaacgtgg atgtgctggg ccgtgacttc cagctgctgc 1320

cgtttggttc gggtcgtcgc ggttgtgcag gcattcgcct gggcctgatc ttcgtacgtc 1380

tggttctggc acagttagtt cactgtttcg actgggaact ggcgcgcaac atggcgagca 1440

gcccggagaa gttggatatg gaagagaagt tcggcctggc ggtgcatcgt gtcaaccacc 1500

tgaaagccct gccgacgtat cgtctggagt gctaagtcga caccatggaa agctt 1555

<210> 81

<211> 506

<212> PRT

<213> 人工序列

<220>

<223> SaCP10374opt, N-末端经修饰的氨基酸序列

<400> 81

Met Ala Leu Leu Leu Ala Val Phe Trp Ser Ala Leu Ile Ile Leu Thr

1 5 10 15

Arg Lys Arg Arg Lys Gly Pro Gly Leu Pro Pro Gly Pro Arg Ala Tyr

20 25 30

Pro Ile Ile Gly Asn Leu His Met Met Gly Gln Leu Pro His His Asn

35 40 45

Leu Arg Glu Leu Ala Arg Glu Tyr Gly Pro Ile Met Ser Met Arg Leu

50 55 60

Gly Leu Val Pro Ala Ile Val Val Ser Ser Pro Glu Ala Ala Gln Leu

65 70 75 80

Phe Leu Lys Thr His Asp Thr Val Phe Ala Ser Arg Pro Lys Thr Glu

85 90 95

Thr Ala Lys Tyr Phe His Tyr Gly Ile Lys Gly Leu Ile Leu Thr Glu

100 105 110

Tyr Gly Pro Tyr Trp Arg Asn Ile Arg Arg Leu Ser Thr Val Lys Leu

115 120 125

Leu Asn Ala Ala Lys Ile Asp Ser Phe Ala Ala Met Arg Arg Ser Glu

130 135 140

Val Glu Arg Leu Val Ala Ser Val Arg Gly Ser Ala Val Arg Arg Glu

145 150 155 160

Val Val Asp Val Ser Ser Lys Val Ala Glu Ala Met Glu Asn Met Val

165 170 175

Cys Gln Met Val Ile Gly Arg Ser Gly Asp Asp Arg Phe Lys Leu Lys

180 185 190

Glu Thr Phe Gln Glu Gly Thr Gln Leu Ala Gly Ala Phe Asn Phe Gly

195 200 205

Glu Phe Val Pro Phe Leu Leu Pro Leu Asp Leu Gln Gly Ile Thr Arg

210 215 220

Arg Ile Lys Glu Val Ser Thr Arg Phe Asn Lys Ile Leu Asp Leu Ile

225 230 235 240

Val Asp Glu His Ile Arg Asp Ala Ala Gly Thr Lys Asn Ser Gly Gly

245 250 255

Arg Asp Ser Asp Asn Phe Leu Asp Val Leu Leu Ser Leu Met Asn Thr

260 265 270

Ser Ile Ser Asp Ser Asn Asp Thr Gly Asp Asn Asn Arg Asn Asn Val

275 280 285

Ile Glu Arg Asp Asn Ile Lys Ala Ile Leu Thr Asp Met Leu Gly Ala

290 295 300

Ala Met Asp Thr Ser Ala Ser Thr Val Glu Trp Thr Ile Ser Glu Leu

305 310 315 320

Phe Arg His Pro Lys Thr Met Gln Lys Leu Gln Ala Glu Ile Arg Gly

325 330 335

Val Val Gly Pro Thr Arg Asn Val Ser Glu Asp Asp Leu Pro Lys Leu

340 345 350

Thr Tyr Leu Asp Met Val Val Lys Glu Gly Met Arg Leu His Pro Ala

355 360 365

Val Pro Leu Leu Leu Pro His Glu Ser Leu Glu Glu Ala Thr Ile Asp

370 375 380

Gly Tyr Tyr Ile Pro Lys Gly Ser Arg Ile Leu Ile Asn Val Trp Ala

385 390 395 400

Ile Gly Arg Asp Pro Lys Ala Trp Pro Asp Arg Pro Glu Glu Phe Ile

405 410 415

Pro Glu Arg Phe Glu Lys Ser Asn Val Asp Val Leu Gly Arg Asp Phe

420 425 430

Gln Leu Leu Pro Phe Gly Ser Gly Arg Arg Gly Cys Ala Gly Ile Arg

435 440 445

Leu Gly Leu Ile Phe Val Arg Leu Val Leu Ala Gln Leu Val His Cys

450 455 460

Phe Asp Trp Glu Leu Ala Arg Asn Met Ala Ser Ser Pro Glu Lys Leu

465 470 475 480

Asp Met Glu Glu Lys Phe Gly Leu Ala Val His Arg Val Asn His Leu

485 490 495

Lys Ala Leu Pro Thr Tyr Arg Leu Glu Cys

500 505

<210> 82

<211> 3693

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm, 用作编码SaCP10374和CPRm的合成操纵子

<400> 82

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag ctt 3693

<210> 83

<211> 5339

<212> DNA

<213> 人工序列

<220>

<223> SaCP816-CPRm-SaTPs647, 用作编码SaCP816、CPRm和倍半香桧烯B合酶的合成

操纵子

<400> 83

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttaggaggta aaaatggcga ccgttgtgga tgattctagc gtcgttcgtc 3720

gttctgcaaa ctacccgccg aatttgtggg actatgagtt cctgcaatcc ctgggtgacc 3780

agtgtacggt cgaagaaaaa cacctgaagc tggccgacaa gttgaaagaa gaagttaaat 3840

ccctgattaa acagacgatg gagccgctgg caaaactgga gttcatcgat accgtgcgtc 3900

gtttgggttt gaaatatcag tttgagaccg aggtgaagga ggccgttgtt atggttagca 3960

aatatgagaa tgatgcgtgg tggattgata atctgcacgc taccagcctg cgtttccgca 4020

tcatgcgtga gaatggtatc ttcgtgccgc aagatgtgtt tgaacgtttc aaagataccg 4080

acggctttaa aaaccaactg tgcgaagacg tgaagggtct gttgtctctg tatgaggcga 4140

gctttctggg ttgggagggc gaggatatct tggatgaggc acgcaccttt gcgaccagca 4200

agctgaagag cattgaaggc aaaattccga gcccgagcct ggctaagaaa gtgagccacg 4260

cgctggactt gcctctgcac tggcgtacca ttcgctacga agcgcgctgg ttcatcgaca 4320

cctacggtga agaagaggac gtgaatctga cgttgctgcg ttacgccaaa ctggacttca 4380

acattgttca atctttttac caaaaagaga tcggccgtct gtcccgctgg tgggtgggta 4440

ctggcctgga taaaatgccg tttgctcgta atggtctgat tcagagctat atgtacgcaa 4500

ttggtatgct gttcgagcct aacctgggcg aggtgcgtga gatggaggcg aaggtcggcg 4560

ccttgattac cacgatcgac gacgtgtatg acgtttacgg cacgatggag gagttggagc 4620

tgttcaccga tattaccaat cgttgggaca tcagcaaagc ggatcaactg ccgcgtaaca 4680

tccgcatgcc gctgctgacg atgttcaaca ccagcaatga tatcggttat tgggctctga 4740

aagagcgtgg tttcaatggc attccgtgta ccgcaaaagt ctggtccgac caactgaaga 4800

gctacaccaa ggaggctaaa tggttccacg aaggccataa accgactctg gaggagtatc 4860

tggacaatgc gctggtcagc atcggcttcc cgaacctgct ggtcacgtct tatctgttga 4920

ccgttgagaa tccgaccaaa gaaaagctgg actatgtgaa cagcctgccg ttgttcgttc 4980

gcgcgagctg catcctgtgt cgtatcatta acgatctggg tacgagcccg gatgaaatgg 5040

agcgtggtga caatctgaaa agcatccagt gctatatgaa cgaaaccggt gcgagccaag 5100

aggttgcgcg tgagcacatc gaaggcctgg ttcgtatgtg gtggaaacgt ctgaacaagt 5160

gcctgtttga gccgagcccg ttcactgagc cgttcctgag ctttacgatt aacgtggtcc 5220

gtggtagcca ctttttctat cagtacggcg atggctacgg caacgcagag agctggacca 5280

agaaccaggg tatgtcggtg ctgatccacc cgattaccct ggatgaagag taagaattc 5339

<210> 84

<211> 5360

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-saTPs647, 用作编码SaCP10374、CPRm和倍半香桧烯B合酶的

合成操纵子

<400> 84

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggcg accgttgtgg 3720

atgattctag cgtcgttcgt cgttctgcaa actacccgcc gaatttgtgg gactatgagt 3780

tcctgcaatc cctgggtgac cagtgtacgg tcgaagaaaa acacctgaag ctggccgaca 3840

agttgaaaga agaagttaaa tccctgatta aacagacgat ggagccgctg gcaaaactgg 3900

agttcatcga taccgtgcgt cgtttgggtt tgaaatatca gtttgagacc gaggtgaagg 3960

aggccgttgt tatggttagc aaatatgaga atgatgcgtg gtggattgat aatctgcacg 4020

ctaccagcct gcgtttccgc atcatgcgtg agaatggtat cttcgtgccg caagatgtgt 4080

ttgaacgttt caaagatacc gacggcttta aaaaccaact gtgcgaagac gtgaagggtc 4140

tgttgtctct gtatgaggcg agctttctgg gttgggaggg cgaggatatc ttggatgagg 4200

cacgcacctt tgcgaccagc aagctgaaga gcattgaagg caaaattccg agcccgagcc 4260

tggctaagaa agtgagccac gcgctggact tgcctctgca ctggcgtacc attcgctacg 4320

aagcgcgctg gttcatcgac acctacggtg aagaagagga cgtgaatctg acgttgctgc 4380

gttacgccaa actggacttc aacattgttc aatcttttta ccaaaaagag atcggccgtc 4440

tgtcccgctg gtgggtgggt actggcctgg ataaaatgcc gtttgctcgt aatggtctga 4500

ttcagagcta tatgtacgca attggtatgc tgttcgagcc taacctgggc gaggtgcgtg 4560

agatggaggc gaaggtcggc gccttgatta ccacgatcga cgacgtgtat gacgtttacg 4620

gcacgatgga ggagttggag ctgttcaccg atattaccaa tcgttgggac atcagcaaag 4680

cggatcaact gccgcgtaac atccgcatgc cgctgctgac gatgttcaac accagcaatg 4740

atatcggtta ttgggctctg aaagagcgtg gtttcaatgg cattccgtgt accgcaaaag 4800

tctggtccga ccaactgaag agctacacca aggaggctaa atggttccac gaaggccata 4860

aaccgactct ggaggagtat ctggacaatg cgctggtcag catcggcttc ccgaacctgc 4920

tggtcacgtc ttatctgttg accgttgaga atccgaccaa agaaaagctg gactatgtga 4980

acagcctgcc gttgttcgtt cgcgcgagct gcatcctgtg tcgtatcatt aacgatctgg 5040

gtacgagccc ggatgaaatg gagcgtggtg acaatctgaa aagcatccag tgctatatga 5100

acgaaaccgg tgcgagccaa gaggttgcgc gtgagcacat cgaaggcctg gttcgtatgt 5160

ggtggaaacg tctgaacaag tgcctgtttg agccgagccc gttcactgag ccgttcctga 5220

gctttacgat taacgtggtc cgtggtagcc actttttcta tcagtacggc gatggctacg 5280

gcaacgcaga gagctggacc aagaaccagg gtatgtcggt gctgatccac ccgattaccc 5340

tggatgaaga gtaagaattc 5360

<210> 85

<211> 5420

<212> DNA

<213> 人工序列

<220>

<223> SaCP816-CPRm-SaTPs30, 用作编码SaCP816、CPRm和β-甜没药烯合酶的合成

操纵子

<400> 85

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttaggaggta aaaatggacg cattcgcaac gagcccgacc agcgcactga 3720

ttaaggcggt taactgcatc gcgcacgtga ccccgatggc aggtgaagat tcctccgaaa 3780

accgccgtgc atcgaactac aaaccgagca cctgggacta tgaatttctg caaagcctgg 3840

ccacgagcca taacaccgtc caggaaaagc acatgaagat ggctgagaaa ttgaaggaag 3900

aggtgaagag catgatcaag ggtcagatgg agccggtggc gaagttggaa ctgatcaaca 3960

tcctgcagcg tctgggtttg aaatatcgct ttgaatccga gatcaaggaa gagctgtttt 4020

ccctgtacaa ggacggtact gatgcgtggt gggttgataa tctgcatgca acggcgctgc 4080

gttttagact gctgcgcgag aatggtattt tcgtgccgca agaagtattc gaaactttaa 4140

aggataagag cggtaagttt aagagccagc tgtgcaagga cgttcgtggt ctgctgagct 4200

tgtacgaggc gtcctacctg ggttgggagg gtgaggactt gctggacgag gccaagaagt 4260

tcagcaccac caacctgaac aatgtgaaag aaagcatcag cagcaacact ctgggtcgct 4320

tggtcaagca cgccctgaac ctgccgctgc actggtctgc ggcacgttac gaggcgagat 4380

ggtttattga cgagtacgaa aaagaagaaa acgttaaccc gaacctgctg aagtacgcga 4440

agtttgactt taacatcgtt cagagcattc accaacgtga gctcggtaac ctcgcgcgtt 4500

ggtgggtaga aaccggcctg gataaactga gcttcgtgcg caatacgttg atgcagaatt 4560

tcatgtgggg ctgtgcgatg gtgttcgaac cgcagtacgg caaggttcgc gatgcggccg 4620

tcaagcaggc cagcctgatt gcgatggtcg acgacgtgta tgacgtttat ggcagcctgg 4680

aagaactgga aatctttacc gatatcgtgg accgttggga tatcaccggt atcgacaagc 4740

tgccgcgtaa catctctatg attctgctga cgatgttcaa taccgcgaat cagattggtt 4800

acgacttgct gcgtgaccgc ggttttaacg gcatcccgca cattgctcag gcgtgggcca 4860

ccctgtgtaa gaaatatctg aaagaggcga agtggtatca tagcggttac aagccaactc 4920

tggaggagta cctggaaaac ggtcttgttt ctattagctt tgtgctgagc cttgttaccg 4980

catatctgca gaccgaaacc ctggagaatc tgacgtatga gtccgctgcg tacgtgaata 5040

gcgtaccgcc actggtccgc tacagcggcc tgctgaatcg tctgtacaac gatctcggta 5100

cgtcaagcgc agaaattgca cgtggtgaca ccctgaaaag catccagtgt tatatgaccc 5160

aaaccggtgc aaccgaggaa gcagcgcgcg agcacattaa aggtctggtt cacgaagcgt 5220

ggaagggcat gaacaaatgc ttgttcgagc agacgccatt cgcggagccg tttgtcggtt 5280

tcaacgtcaa taccgtccgc ggttcccaat tcttctacca gcatggcgac ggctacgcgg 5340

ttacggaaag ctggacgaag gacctgagcc tgtcggtgct gattcacccg atcccgctga 5400

atgaagagga ctaagaattc 5420

<210> 86

<211> 5441

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-SaTPs30, 用作编码SaCP10374、CPRm和β-甜没药烯合酶的

合成操纵子

<400> 86

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggac gcattcgcaa 3720

cgagcccgac cagcgcactg attaaggcgg ttaactgcat cgcgcacgtg accccgatgg 3780

caggtgaaga ttcctccgaa aaccgccgtg catcgaacta caaaccgagc acctgggact 3840

atgaatttct gcaaagcctg gccacgagcc ataacaccgt ccaggaaaag cacatgaaga 3900

tggctgagaa attgaaggaa gaggtgaaga gcatgatcaa gggtcagatg gagccggtgg 3960

cgaagttgga actgatcaac atcctgcagc gtctgggttt gaaatatcgc tttgaatccg 4020

agatcaagga agagctgttt tccctgtaca aggacggtac tgatgcgtgg tgggttgata 4080

atctgcatgc aacggcgctg cgttttagac tgctgcgcga gaatggtatt ttcgtgccgc 4140

aagaagtatt cgaaacttta aaggataaga gcggtaagtt taagagccag ctgtgcaagg 4200

acgttcgtgg tctgctgagc ttgtacgagg cgtcctacct gggttgggag ggtgaggact 4260

tgctggacga ggccaagaag ttcagcacca ccaacctgaa caatgtgaaa gaaagcatca 4320

gcagcaacac tctgggtcgc ttggtcaagc acgccctgaa cctgccgctg cactggtctg 4380

cggcacgtta cgaggcgaga tggtttattg acgagtacga aaaagaagaa aacgttaacc 4440

cgaacctgct gaagtacgcg aagtttgact ttaacatcgt tcagagcatt caccaacgtg 4500

agctcggtaa cctcgcgcgt tggtgggtag aaaccggcct ggataaactg agcttcgtgc 4560

gcaatacgtt gatgcagaat ttcatgtggg gctgtgcgat ggtgttcgaa ccgcagtacg 4620

gcaaggttcg cgatgcggcc gtcaagcagg ccagcctgat tgcgatggtc gacgacgtgt 4680

atgacgttta tggcagcctg gaagaactgg aaatctttac cgatatcgtg gaccgttggg 4740

atatcaccgg tatcgacaag ctgccgcgta acatctctat gattctgctg acgatgttca 4800

ataccgcgaa tcagattggt tacgacttgc tgcgtgaccg cggttttaac ggcatcccgc 4860

acattgctca ggcgtgggcc accctgtgta agaaatatct gaaagaggcg aagtggtatc 4920

atagcggtta caagccaact ctggaggagt acctggaaaa cggtcttgtt tctattagct 4980

ttgtgctgag ccttgttacc gcatatctgc agaccgaaac cctggagaat ctgacgtatg 5040

agtccgctgc gtacgtgaat agcgtaccgc cactggtccg ctacagcggc ctgctgaatc 5100

gtctgtacaa cgatctcggt acgtcaagcg cagaaattgc acgtggtgac accctgaaaa 5160

gcatccagtg ttatatgacc caaaccggtg caaccgagga agcagcgcgc gagcacatta 5220

aaggtctggt tcacgaagcg tggaagggca tgaacaaatg cttgttcgag cagacgccat 5280

tcgcggagcc gtttgtcggt ttcaacgtca ataccgtccg cggttcccaa ttcttctacc 5340

agcatggcga cggctacgcg gttacggaaa gctggacgaa ggacctgagc ctgtcggtgc 5400

tgattcaccc gatcccgctg aatgaagagg actaagaatt c 5441

<210> 87

<211> 5414

<212> DNA

<213> 人工序列

<220>

<223> SaCP816-CPRm-AaBFS, 用作编码SaCP816、CPRm和β-法呢烯合酶的合成操纵子

<400> 87

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttaggaggta aaaatgtcta ccctgccaat ttcttctgtg tcctttagct 3720

ccagcacttc gccactggtt gtcgatgaca aggtgagcac gaaaccggat gtgatccgtc 3780

acacgatgaa cttcaacgcg agcatttggg gcgatcaatt cctgacctat gacgagccgg 3840

aagatctggt aatgaagaaa caactggttg aggaacttaa agaagaagtg aagaaagaat 3900

tgatcaccat caagggtagc aacgagccga tgcaacatgt caagctgatc gagttgatcg 3960

acgcagttca acgcctgggc attgcctacc actttgaaga agagattgaa gaggccctgc 4020

agcacattca tgtcacctac ggtgagcagt gggtggacaa agagaatttg caatccatca 4080

gcctgtggtt tcgtctgctg cgtcaacagg gcttcaacgt gagcagcggt gtgtttaaag 4140

atttcatgga cgaaaagggt aagtttaaag agtccctgtg caatgatgca cagggtattt 4200

tggcgctgta tgaggccgca ttcatgcgcg ttgaagatga aaccattctg gacaacgctc 4260

tggagttcac caaggtgcat ctggacatca tcgctaagga cccgagctgt gattctagcc 4320

tgcgcacgca gattcaccag gctctgaagc agccgctgcg ccgtcgcctg gcacgtattg 4380

aggcgttaca ctatatgccg atctatcagc aagagactag ccatgacgaa gttctgctga 4440

aactggcaaa gctggacttt agcgttctgc agagcatgca caagaaagaa ctcagccata 4500

tttgcaagtg gtggaaagat ctggatctgc agaataagct gccgtacgtt cgtgaccgtg 4560

tcgttgaggg ctatttctgg atcttgagca tttactacga gccgcaacat gcgcgtaccc 4620

gtatgttcct gatgaaaacc tgtatgtggt tggttgtgct ggacgacacg tttgataact 4680

acggcacgta cgaagagttg gagattttca cccaagcggt agaacgttgg agcatctcgt 4740

gtctggacat gctgcctgag tatatgaagc tgatctacca ggaactggtc aatttacacg 4800

tcgagatgga agagagcctg gagaaagaag gcaagaccta tcagattcac tatgtgaaag 4860

aaatggcgaa agaactggtc cgcaactacc tggttgaggc gcgctggctg aaagagggct 4920

acatgccgac cctggaagag tatatgagcg tgagcatggt cacgggtacg tacggtctga 4980

tgatcgcccg cagctacgtc ggtcgtggcg acatcgttac cgaagatacc ttcaaatggg 5040

tttctagcta cccgccgatt atcaaggcaa gctgcgttat cgtgcgtttg atggatgata 5100

ttgttagcca caaagaagaa caagagcgtg gtcatgttgc tagcagcatt gagtgctaca 5160

gcaaagagtc cggtgcaagc gaagaagaag cgtgcgagta tatcagccgt aaggtcgagg 5220

acgcgtggaa agtcattaat cgcgagtccc tgcgtccgac cgcggttccg tttccgctgc 5280

tgatgcctgc gattaatctg gcgcgtatgt gtgaggtcct gtacagcgtg aatgacggtt 5340

ttacgcacgc cgagggtgat atgaaaagct atatgaagtc attctttgtg cacccgatgg 5400

ttgtgtaaga attc 5414

<210> 88

<211> 5435

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-AaBFS, 用作编码SaCP10374、CPRm和β-法呢烯合酶的合成

操纵子

<400> 88

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagag ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatgtct accctgccaa 3720

tttcttctgt gtcctttagc tccagcactt cgccactggt tgtcgatgac aaggtgagca 3780

cgaaaccgga tgtgatccgt cacacgatga acttcaacgc gagcatttgg ggcgatcaat 3840

tcctgaccta tgacgagccg gaagatctgg taatgaagaa acaactggtt gaggaactta 3900

aagaagaagt gaagaaagaa ttgatcacca tcaagggtag caacgagccg atgcaacatg 3960

tcaagctgat cgagttgatc gacgcagttc aacgcctggg cattgcctac cactttgaag 4020

aagagattga agaggccctg cagcacattc atgtcaccta cggtgagcag tgggtggaca 4080

aagagaattt gcaatccatc agcctgtggt ttcgtctgct gcgtcaacag ggcttcaacg 4140

tgagcagcgg tgtgtttaaa gatttcatgg acgaaaaggg taagtttaaa gagtccctgt 4200

gcaatgatgc acagggtatt ttggcgctgt atgaggccgc attcatgcgc gttgaagatg 4260

aaaccattct ggacaacgct ctggagttca ccaaggtgca tctggacatc atcgctaagg 4320

acccgagctg tgattctagc ctgcgcacgc agattcacca ggctctgaag cagccgctgc 4380

gccgtcgcct ggcacgtatt gaggcgttac actatatgcc gatctatcag caagagacta 4440

gccatgacga agttctgctg aaactggcaa agctggactt tagcgttctg cagagcatgc 4500

acaagaaaga actcagccat atttgcaagt ggtggaaaga tctggatctg cagaataagc 4560

tgccgtacgt tcgtgaccgt gtcgttgagg gctatttctg gatcttgagc atttactacg 4620

agccgcaaca tgcgcgtacc cgtatgttcc tgatgaaaac ctgtatgtgg ttggttgtgc 4680

tggacgacac gtttgataac tacggcacgt acgaagagtt ggagattttc acccaagcgg 4740

tagaacgttg gagcatctcg tgtctggaca tgctgcctga gtatatgaag ctgatctacc 4800

aggaactggt caatttacac gtcgagatgg aagagagcct ggagaaagaa ggcaagacct 4860

atcagattca ctatgtgaaa gaaatggcga aagaactggt ccgcaactac ctggttgagg 4920

cgcgctggct gaaagagggc tacatgccga ccctggaaga gtatatgagc gtgagcatgg 4980

tcacgggtac gtacggtctg atgatcgccc gcagctacgt cggtcgtggc gacatcgtta 5040

ccgaagatac cttcaaatgg gtttctagct acccgccgat tatcaaggca agctgcgtta 5100

tcgtgcgttt gatggatgat attgttagcc acaaagaaga acaagagcgt ggtcatgttg 5160

ctagcagcat tgagtgctac agcaaagagt ccggtgcaag cgaagaagaa gcgtgcgagt 5220

atatcagccg taaggtcgag gacgcgtgga aagtcattaa tcgcgagtcc ctgcgtccga 5280

ccgcggttcc gtttccgctg ctgatgcctg cgattaatct ggcgcgtatg tgtgaggtcc 5340

tgtacagcgt gaatgacggt tttacgcacg ccgagggtga tatgaaaagc tatatgaagt 5400

cattctttgt gcacccgatg gttgtgtaag aattc 5435

<210> 89

<211> 5432

<212> DNA

<213> 人工序列

<220>

<223> SaCP816-CPRm-PaAFS, 用作编码SaCP816、CPRm和α-法呢烯合酶的合成操纵子

<400> 89

catatggcac tgttgttggc ggttttctgg agcgctttga ttattctggt tagcatctta 60

ttgcgtcgtc gtcaaaaacg caacaatttg ccaccgggcc caccggccct gccgatcatc 120

ggtaacattc acattctggg caccctgccg caccagagcc tgtacaatct ggcgaagaag 180

tacggtccga tcatgtccat gcgtttgggc ttggttccgg cggtggtcat cagcagcccg 240

gaagcggccg agctggtcct gaaaacccac gacatcgttt ttgcttctcg ccctcgtctg 300

caagttgcag attactttca ctatggcacc aaaggcgtga ttctgaccga atatggtacc 360

tactggcgta acatgcgtcg cctgtgcacg gtcaaactgc tgaacaccgt taagattgat 420

agctttgcag gcacccgcaa gaaagaagtc gctagcttcg ttcagagcct gaaagaagca 480

agcgtggcgc acaaaatggt taacctgtcc gcacgcgtcg ctaatgttat tgagaatatg 540

gtttgtctga tggttattgg tagatcgtct gacgagcgtt tcaagctgaa agaagtgatc 600

caagaagcgg cacagctggc gggtgccttc aatattggtg actatgtccc gtttctgatg 660

ccgctggatc tgcagggcct gactcgccgt atcaagagcg gtagcaaggc attcgatgac 720

atcctcgagg tcattatcga cgagcatgtg caagacatta aagatcatga cgatgagcag 780

catggtgact tcatcgacgt gctgctggcg atgatgaata agccgatgga ttctcgtgag 840

ggtctgtcca tcattgatcg cacgaacatt aaagcgatcc tggtggatat gatcggtgcc 900

gcgatggaca cgagcaccag cggtgtggag tgggcgattt cggagctgat taagcatcct 960

cgtgtcatga agaaactgca agacgaagtg aaaaccgtaa tcggtatgaa ccgcatggtg 1020

gaagaagcgg atctgccgaa actgccgtac ctggacatgg ttgtcaagga aacgatgcgt 1080

ctgcatccgc caggcccgct gctggtgccg cgtgaaagca tggaagatat tacgatcaac 1140

ggttactata tcccgaagaa atcccgcatt attgtgaatg catgggcgat cggccgtgac 1200

accaacgcct ggagcaataa tgcgcacgag tttttccctg agcgttttat gagctctaac 1260

gttgatctgc aaggccagga cttccagctg atcccgttcg gtagcggtcg tcgcggttgt 1320

ccgggcatgc gtctgggtct gacgacggtc cgcttggtgc tggcccaact gattcactgc 1380

ttcgacctgg agcttccgaa gggcaccgtc gcgactgacc tggatatgag cgagaagttt 1440

ggtctggcaa tgccgcgtgc gcagcactta ctggcctttc cgacctaccg tctggagagc 1500

taagtcgact aactttaaga aggagatata tccatggaac ctagctctca gaaactgtct 1560

ccgttggaat ttgttgctgc tatcctgaag ggcgactaca gcagcggtca ggttgaaggt 1620

ggtccaccgc caggtctggc agctatgttg atggaaaata aggatttggt gatggttctg 1680

acgacgtccg tggcagtcct gatcggctgt gtcgtggtcc tggcatggcg tcgtgcggca 1740

ggtagcggta agtacaagca acctgaactg cctaaactgg tggtcccgaa agcagccgaa 1800

ccggaggagg cagaggatga taaaaccaag atcagcgtgt ttttcggcac ccaaaccggt 1860

acggcagaag gtttcgcgaa ggcttttgtt gaagaggcca aggcgcgtta tcagcaggcc 1920

cgtttcaaag ttatcgacct ggacgactat gcggcagacg atgacgagta cgaagagaaa 1980

ctgaagaagg aaaacttggc attcttcttc ttggcgtcct acggtgacgg cgagccgacg 2040

gacaacgcgg cacgctttta caaatggttt acggagggta aggaccgtgg tgaatggctg 2100

aacaatctgc agtacggcgt ttttggtctg ggtaaccgtc aatatgagca tttcaataag 2160

atcgccattg tcgtcgatga tctgatcttc gagcaaggtg gcaagaagct ggttccggtg 2220

ggtctgggtg acgatgacca gtgcattgag gatgattttg cggcgtggcg tgaactggtc 2280

tggccggaac tggataaact gctgcgtaac gaagacgacg ctaccgtggc aaccccgtac 2340

agcgccgctg tgctgcaata ccgcgtggtt ttccacgatc acattgacgg cctgattagc 2400

gaaaacggta gcccgaacgg tcatgctaat ggcaataccg tgtacgatgc gcaacacccg 2460

tgccgtagca acgtcgcggt caagaaggaa ttgcatactc cggcgagcga tcgcagctgc 2520

acccacctgg aatttaacat tagcggtacc ggcctgatgt acgagacggg tgaccacgtc 2580

ggtgtgtatt gcgagaacct gttggaaacc gtggaggagg ccgagaagtt gttgaacctg 2640

agcccgcaga cgtacttctc cgttcacacc gacaacgagg acggtacgcc gttgagcggc 2700

agcagcctgc cgccaccgtt tccgccgtgc accttgcgca cggcattgac caaatacgca 2760

gacttgactt ctgcaccgaa aaagtcggtg ctggtggcgc tggccgagta cgcatctgac 2820

cagggtgaag cggatcgttt gcgtttcttg gcgagcccga gcggcaaaga ggaatatgca 2880

cagtacatct tggcaagcca gcgcacgctg ctggaggtca tggcggagtt cccgtcggcg 2940

aaaccgccgc tgggtgtctt tttcgcgggt gtcgctccgc gcctgcagcc gcgtttctat 3000

tccattagct ctagcccgaa gatcgcaccg ttccgtattc acgtgacctg cgccctggtt 3060

tatgacaaat cccctaccgg tcgcgttcat aagggcatct gtagcacgtg gatgaaaaat 3120

gcggtcccgc tggaagaaag caacgattgt tcctgggctc cgatcttcgt ccgcaacagc 3180

aacttcaagc tgccgaccga cccgaaggtt ccgattatca tgattggtcc gggtaccggt 3240

ctggcccctt ttcgtggctt tttgcaagag cgcttggcgt tgaaagagag cggtgctgaa 3300

ttgggtccgg cgatcttgtt ctttggttgc cgtaaccgta aaatggactt tatttacgag 3360

gatgaactga atgatttcgt caaagcgggc gttgtcagcg agctgatcgt cgcttttagc 3420

cgcgaaggcc cgatgaaaga atacgtgcaa cacaaaatga gccaacgtgc ctccgatgtg 3480

tggaacatca ttagcgacgg tggttatgtt tatgtttgcg gtgacgcgaa gggtatggct 3540

cgtgatgttc accgtaccct gcataccatc gcacaggagc aaggtagcat gtccagctcg 3600

gaggccgaag gtatggtcaa aaacctgcaa accaccggtc gttacctgcg tgatgtgtgg 3660

taataaaagc ttaggaggta aaaatggatc tggcagtgga aatcgcaatg gacctggcag 3720

tggatgatgt tgaacgccgt gtgggtgact atcattccaa tctgtgggac gacgacttca 3780

tccaaagcct gagcaccccg tatggcgcca gctcttaccg cgagcgtgcg gagcgcttgg 3840

tcggcgaggt caaagaaatg tttacgagca tcagcatcga ggatggtgag ctgacctctg 3900

acttgcttca acgcctgtgg atggttgaca acgttgagcg cctgggcatt agccgtcact 3960

tcgagaatga aatcaaggct gcaattgatt acgtctacag ctattggtcc gacaagggta 4020

ttgtccgtgg tagagatagc gccgtgccgg atctgaacag cattgctctg ggtttccgta 4080

cgttgcgtct gcatggttac accgttagca gcgatgtttt caaggtcttt caagaccgca 4140

aaggtgagtt tgcatgtagc gcgattccga cggaaggtga catcaaaggc gtactgaatc 4200

tgctgcgtgc aagctacatc gcgttccctg gtgagaaagt gatggagaaa gcgcagacct 4260

ttgccgcaac ttatctgaaa gaagcactgc agaagatcca agtgtctagc ctgagccgcg 4320

agatcgaata cgttctggag tatggctggc tgaccaattt tccgcgcctg gaagcgcgta 4380

actacatcga cgttttcggt gaggaaattt gtccgtactt caagaaaccg tgtattatgg 4440

ttgataagct gctggaactg gcgaaactgg agttcaattt gtttcactcg ctgcaacaga 4500

ccgagctgaa acacgtttcc cgttggtgga aggatagcgg ctttagccag ctgaccttca 4560

cgcgtcatcg tcacgtggag ttttacaccc tggctagctg tattgcaatt gaaccgaaac 4620

attctgcgtt tcgtttgggt ttcgcgaagg tctgctacct gggcattgtg ctggacgata 4680

tctatgacac gttcggtaaa atgaaagaac tggagttatt cacggcggca atcaagcgtt 4740

gggacccgag cacgaccgag tgcctgcctg agtatatgaa aggtgtctac atggcgtttt 4800

acaactgcgt taatgaactg gcgctgcaag ccgagaaaac ccagggccgt gacatgttga 4860

actatgcacg taaggcgtgg gaagccctgt tcgatgcgtt cctggaagaa gcgaagtgga 4920

ttagctccgg ctatctgccg acctttgagg aatacctgga gaacggcaaa gtgtccttcg 4980

gttatcgtgc tgccactctg cagccaatcc tgaccctgga cattccgttg ccgctgcaca 5040

tcttgcagca gatcgatttc ccgagccgct ttaatgacct ggccagctca attttgcgtc 5100

tgcgcggtga tatctgcggt tatcaagccg agcgttctcg tggcgaagag gcgagcagca 5160

ttagctgcta catgaaggac aatccgggtt ccaccgagga agatgcgctg agccacatta 5220

acgcgatgat ttcggacaac atcaacgaac tgaattggga gctgctgaag ccgaacagca 5280

atgttccaat cagcagcaaa aagcacgctt tcgatatcct gcgtgcgttt taccatctct 5340

ataagtaccg tgatggtttt agcattgcga agattgaaac gaagaacctg gtgatgcgca 5400

ccgtcctgga gccggtcccg atgtaagaat tc 5432

<210> 90

<211> 5453

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-PaAFS, 用作编码SaCP10374、CPRm和α-法呢烯合酶的合成

操纵子

<400> 90

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaaatggat ctggcagtgg 3720

aaatcgcaat ggacctggca gtggatgatg ttgaacgccg tgtgggtgac tatcattcca 3780

atctgtggga cgacgacttc atccaaagcc tgagcacccc gtatggcgcc agctcttacc 3840

gcgagcgtgc ggagcgcttg gtcggcgagg tcaaagaaat gtttacgagc atcagcatcg 3900

aggatggtga gctgacctct gacttgcttc aacgcctgtg gatggttgac aacgttgagc 3960

gcctgggcat tagccgtcac ttcgagaatg aaatcaaggc tgcaattgat tacgtctaca 4020

gctattggtc cgacaagggt attgtccgtg gtagagatag cgccgtgccg gatctgaaca 4080

gcattgctct gggtttccgt acgttgcgtc tgcatggtta caccgttagc agcgatgttt 4140

tcaaggtctt tcaagaccgc aaaggtgagt ttgcatgtag cgcgattccg acggaaggtg 4200

acatcaaagg cgtactgaat ctgctgcgtg caagctacat cgcgttccct ggtgagaaag 4260

tgatggagaa agcgcagacc tttgccgcaa cttatctgaa agaagcactg cagaagatcc 4320

aagtgtctag cctgagccgc gagatcgaat acgttctgga gtatggctgg ctgaccaatt 4380

ttccgcgcct ggaagcgcgt aactacatcg acgttttcgg tgaggaaatt tgtccgtact 4440

tcaagaaacc gtgtattatg gttgataagc tgctggaact ggcgaaactg gagttcaatt 4500

tgtttcactc gctgcaacag accgagctga aacacgtttc ccgttggtgg aaggatagcg 4560

gctttagcca gctgaccttc acgcgtcatc gtcacgtgga gttttacacc ctggctagct 4620

gtattgcaat tgaaccgaaa cattctgcgt ttcgtttggg tttcgcgaag gtctgctacc 4680

tgggcattgt gctggacgat atctatgaca cgttcggtaa aatgaaagaa ctggagttat 4740

tcacggcggc aatcaagcgt tgggacccga gcacgaccga gtgcctgcct gagtatatga 4800

aaggtgtcta catggcgttt tacaactgcg ttaatgaact ggcgctgcaa gccgagaaaa 4860

cccagggccg tgacatgttg aactatgcac gtaaggcgtg ggaagccctg ttcgatgcgt 4920

tcctggaaga agcgaagtgg attagctccg gctatctgcc gacctttgag gaatacctgg 4980

agaacggcaa agtgtccttc ggttatcgtg ctgccactct gcagccaatc ctgaccctgg 5040

acattccgtt gccgctgcac atcttgcagc agatcgattt cccgagccgc tttaatgacc 5100

tggccagctc aattttgcgt ctgcgcggtg atatctgcgg ttatcaagcc gagcgttctc 5160

gtggcgaaga ggcgagcagc attagctgct acatgaagga caatccgggt tccaccgagg 5220

aagatgcgct gagccacatt aacgcgatga tttcggacaa catcaacgaa ctgaattggg 5280

agctgctgaa gccgaacagc aatgttccaa tcagcagcaa aaagcacgct ttcgatatcc 5340

tgcgtgcgtt ttaccatctc tataagtacc gtgatggttt tagcattgcg aagattgaaa 5400

cgaagaacct ggtgatgcgc accgtcctgg agccggtccc gatgtaagaa ttc 5453

<210> 91

<211> 5370

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-ClTps2, 用作编码SaCP10374、CPRm和α-檀香萜

合酶的合成操纵子

<400> 91

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttgaaggag atatactaat gtctacccag 3720

caggttagct ccgagaatat cgttcgcaac gcggcgaact tccacccgaa tatctggggt 3780

aatcatttct tgacgtgtcc aagccagacg atcgattctt ggacgcaaca acaccataaa 3840

gagctgaaag aagaggtccg caagatgatg gtgagcgacg caaacaaacc ggcacaacgt 3900

ctgcgtctga ttgacaccgt tcaacgtttg ggcgtggcgt atcatttcga aaaagaaatc 3960

gatgacgctc tggaaaagat cggtcacgat ccgtttgacg ataaggatga cctgtatatc 4020

gttagcctgt gttttcgcct gctgcgtcag catggcatca agattagctg cgatgttttt 4080

gagaagttca aagacgacga tggcaagttt aaggcttccc tgatgaatga tgtccaaggt 4140

atgctgtcgt tgtatgaagc ggcccacctg gcaattcatg gcgaggacat cctggatgag 4200

gctattgtct ttacgaccac ccacctgaag agcaccgttt ctaactcccc ggtcaattcc 4260

acctttgcgg aacagattcg ccacagcctg cgtgtgccgc tgcgtaaggc agtcccgcgt 4320

ttggagagcc gctacttcct ggatatctat agccgtgacg acctgcacga caagactctg 4380

ctgaactttg ccaaactgga cttcaacatc ctgcaggcga tgcaccagaa agaggcaagc 4440

gagatgaccc gttggtggcg tgatttcgat ttcctgaaga agctgccgta cattcgtgat 4500

cgcgtggttg aactgtactt ttggattttg gtcggtgtga gctaccaacc gaaattcagc 4560

acgggtcgta tctttttgag caagattatc tgtctggaaa ccctggtgga cgacacgttt 4620

gatgcgtacg gtactttcga cgaactggcc attttcaccg aggccgttac gcgttgggac 4680

ctgggtcatc gcgacgcgct gcctgagtac atgaaattca ttttcaagac cctgattgat 4740

gtgtacagcg aggcggaaca agagctggca aaagagggcc gctcctatag cattcactat 4800

gcgatccgta gcttccagga gttggtcatg aagtactttt gcgaggcgaa atggctgaat 4860

aagggttatg ttccgagcct ggatgactac aagagcgtca gcctgcgcag catcggcttc 4920

ctgccgatcg ccgtggcttc ttttgttttc atgggcgaca ttgctacgaa agaggttttt 4980

gagtgggaaa tgaataaccc gaaaatcatc atcgcagccg aaaccatttt ccgctttctg 5040

gatgacattg caggtcatcg cttcgaacaa aaacgtgagc acagcccgag cgcaatcgag 5100

tgctacaaaa accaacatgg tgtctcggaa gaagaggcag tgaaagcgct gagcttggag 5160

gtcgccaatt cgtggaaaga cattaacgaa gagctgctgc tgaaccctat ggcaattcca 5220

ctgccgttgc tgcaggtgat cctggatttg agccgtagcg cggacttcat gtacggtaat 5280

gcgcaggacc gtttcacgca ctccaccatg atgaaagatc aagttgacct ggttctgaaa 5340

gatccggtga aactggacga ttaagaattc 5370

<210> 92

<211> 5423

<212> DNA

<213> 人工序列

<220>

<223> SaCP10374-CPRm-SaTps8201, 用作编码SaCP10374, CPRm和α-/β-檀香萜合酶

的合成操纵子

<400> 92

catatggcac tgctgctggc tgtcttttgg agcgcactga ttattctgac ccgcaaacgc 60

cgcaaaggtc cgggtctgcc accgggtccg cgtgcgtacc cgattattgg caatctgcac 120

atgatgggcc agctgccaca ccacaatttg cgtgagctgg cacgtgagta tggtccgatt 180

atgagcatgc gcctgggtct ggtgccggca atcgtggtta gctctcctga ggctgcgcag 240

ctgttcctca agacgcatga taccgttttc gcgagccgtc caaagaccga gactgccaaa 300

tacttccatt acggtatcaa aggtctgatc ctgaccgagt atggcccgta ctggcgcaat 360

attcgtcgtt tgagcaccgt taagctgttg aatgccgcga aaatcgatag cttcgcggct 420

atgcgtagaa gcgaagttga acgcctggtc gcgtccgttc gtggttcggc ggttcgtcgt 480

gaggttgtgg acgtcagcag caaagtggcg gaagctatgg agaatatggt ctgccagatg 540

gttatcggcc gttcaggtga cgatcgtttt aagctgaaag aaacctttca agagggcacc 600

caactggcag gcgcgttcaa ttttggtgag tttgtgccgt ttctgctgcc gctggacttg 660

caaggtatta cccgtcgcat caaagaagtc agcactcgtt tcaataagat tttggacctg 720

atcgttgacg agcacattcg cgatgccgct ggtaccaaaa acagcggcgg tcgtgatagc 780

gacaattttc tggatgttct gctgtccttg atgaacacct ctattagcga tagcaatgac 840

acgggtgaca acaaccgtaa caacgtgatc gagcgtgata acattaaagc gatcctgacg 900

gacatgctgg gtgcagcgat ggacacgagc gcgagcacgg tcgagtggac gatctccgaa 960

ctgtttcgcc acccgaaaac catgcagaag ctgcaagcag aaatccgtgg tgtcgtgggc 1020

ccgacccgca atgtgagcga agatgacttg ccgaagctga cctatctgga catggtcgtt 1080

aaggaaggca tgcgtttgca tccggccgtg ccgctgcttc tgccgcatga gtctctggaa 1140

gaagccacga tcgatggcta ctacattccg aagggttccc gcattctgat caacgtctgg 1200

gcgattggtc gcgacccgaa ggcctggccg gatcgtcctg aagagttcat cccggagcgt 1260

ttcgagaaaa gcaacgtgga tgtgctgggc cgtgacttcc agctgctgcc gtttggttcg 1320

ggtcgtcgcg gttgtgcagg cattcgcctg ggcctgatct tcgtacgtct ggttctggca 1380

cagttagttc actgtttcga ctgggaactg gcgcgcaaca tggcgagcag cccggagaag 1440

ttggatatgg aagagaagtt cggcctggcg gtgcatcgtg tcaaccacct gaaagccctg 1500

ccgacgtatc gtctggagtg ctaagtcgac taactttaag aaggagatat atccatggaa 1560

cctagctctc agaaactgtc tccgttggaa tttgttgctg ctatcctgaa gggcgactac 1620

agcagcggtc aggttgaagg tggtccaccg ccaggtctgg cagctatgtt gatggaaaat 1680

aaggatttgg tgatggttct gacgacgtcc gtggcagtcc tgatcggctg tgtcgtggtc 1740

ctggcatggc gtcgtgcggc aggtagcggt aagtacaagc aacctgaact gcctaaactg 1800

gtggtcccga aagcagccga accggaggag gcagaggatg ataaaaccaa gatcagcgtg 1860

tttttcggca cccaaaccgg tacggcagaa ggtttcgcga aggcttttgt tgaagaggcc 1920

aaggcgcgtt atcagcaggc ccgtttcaaa gttatcgacc tggacgacta tgcggcagac 1980

gatgacgagt acgaagagaa actgaagaag gaaaacttgg cattcttctt cttggcgtcc 2040

tacggtgacg gcgagccgac ggacaacgcg gcacgctttt acaaatggtt tacggagggt 2100

aaggaccgtg gtgaatggct gaacaatctg cagtacggcg tttttggtct gggtaaccgt 2160

caatatgagc atttcaataa gatcgccatt gtcgtcgatg atctgatctt cgagcaaggt 2220

ggcaagaagc tggttccggt gggtctgggt gacgatgacc agtgcattga ggatgatttt 2280

gcggcgtggc gtgaactggt ctggccggaa ctggataaac tgctgcgtaa cgaagacgac 2340

gctaccgtgg caaccccgta cagcgccgct gtgctgcaat accgcgtggt tttccacgat 2400

cacattgacg gcctgattag cgaaaacggt agcccgaacg gtcatgctaa tggcaatacc 2460

gtgtacgatg cgcaacaccc gtgccgtagc aacgtcgcgg tcaagaagga attgcatact 2520

ccggcgagcg atcgcagctg cacccacctg gaatttaaca ttagcggtac cggcctgatg 2580

tacgagacgg gtgaccacgt cggtgtgtat tgcgagaacc tgttggaaac cgtggaggag 2640

gccgagaagt tgttgaacct gagcccgcag acgtacttct ccgttcacac cgacaacgag 2700

gacggtacgc cgttgagcgg cagcagcctg ccgccaccgt ttccgccgtg caccttgcgc 2760

acggcattga ccaaatacgc agacttgact tctgcaccga aaaagtcggt gctggtggcg 2820

ctggccgagt acgcatctga ccagggtgaa gcggatcgtt tgcgtttctt ggcgagcccg 2880

agcggcaaag aggaatatgc acagtacatc ttggcaagcc agcgcacgct gctggaggtc 2940

atggcggagt tcccgtcggc gaaaccgccg ctgggtgtct ttttcgcggg tgtcgctccg 3000

cgcctgcagc cgcgtttcta ttccattagc tctagcccga agatcgcacc gttccgtatt 3060

cacgtgacct gcgccctggt ttatgacaaa tcccctaccg gtcgcgttca taagggcatc 3120

tgtagcacgt ggatgaaaaa tgcggtcccg ctggaagaaa gcaacgattg ttcctgggct 3180

ccgatcttcg tccgcaacag caacttcaag ctgccgaccg acccgaaggt tccgattatc 3240

atgattggtc cgggtaccgg tctggcccct tttcgtggct ttttgcaaga gcgcttggcg 3300

ttgaaagaga gcggtgctga attgggtccg gcgatcttgt tctttggttg ccgtaaccgt 3360

aaaatggact ttatttacga ggatgaactg aatgatttcg tcaaagcggg cgttgtcagc 3420

gagctgatcg tcgcttttag ccgcgaaggc ccgatgaaag aatacgtgca acacaaaatg 3480

agccaacgtg cctccgatgt gtggaacatc attagcgacg gtggttatgt ttatgtttgc 3540

ggtgacgcga agggtatggc tcgtgatgtt caccgtaccc tgcataccat cgcacaggag 3600

caaggtagca tgtccagctc ggaggccgaa ggtatggtca aaaacctgca aaccaccggt 3660

cgttacctgc gtgatgtgtg gtaataaaag cttaggaggt aaaacatatg gacagcagca 3720

ccgccaccgc aatgaccgca ccattcatcg acccgacgga tcatgtgaat ctgaaaaccg 3780

acacggatgc gagcgaaaat cgtcgtatgg gtaactacaa gccgagcatt tggaactacg 3840

attttctgca gtccctggcg acgcaccaca acattgttga agagcgtcac ctgaagctgg 3900

cagagaaact gaaaggtcaa gtgaaattca tgttcggtgc gccgatggag ccattggcta 3960

agttggagct ggttgatgtg gtgcaacgct tgggtctgaa ccacctgttc gagactgaaa 4020

tcaaagaagc tctgttcagc atctacaaag atggcagcaa tggctggtgg tttggccatc 4080

tgcatgctac ctctttgcgc ttccgtctgt tgcgccaatg tggcctgttt atcccgcagg 4140

acgttttcaa aacctttcaa aacaagaccg gtgagtttga catgaagctg tgcgacaacg 4200

ttaagggcct gctgagcctg tacgaggcga gctacctggg ctggaagggc gagaacatct 4260

tggatgaagc aaaggcgttc acgaccaagt gcctgaagag cgcatgggag aacattagcg 4320

agaagtggct ggcgaagcgt gttaaacatg cgttggcgct gccgctgcac tggcgtgttc 4380

cgcgtattga agcacgctgg tttatcgagg cctacgaaca agaggccaat atgaatccga 4440

cgctgctgaa actggcgaaa ctggacttca acatggtcca aagcattcac cagaaagaaa 4500

tcggtgaact ggcccgctgg tgggttacta ccggcctgga caagctggcg ttcgcacgca 4560

acaatctgtt gcagtcttat atgtggagct gcgccatcgc gtccgacccg aaattcaaac 4620

tggcgcgtga aaccattgtc gagatcggtt ccgtgttgac ggttgtcgac gacggctatg 4680

atgtgtacgg ttctatcgat gagctggacc tgtacaccag ctcggtggag cgttggtcct 4740

gtgtcgagat tgacaagctg cctaatacgc tgaagctgat ctttatgtct atgttcaaca 4800

aaaccaacga ggtgggtctg cgtgttcaac acgagcgtgg ttacaatagc atcccgacct 4860

tcattaaggc gtgggtggaa cagtgtaaga gctatcaaaa agaggcgcgt tggtttcatg 4920

gtggtcacac gcctccgctg gaagaataca gcctgaacgg tctggtcagc attggttttc 4980

cgctgttgct gatcaccggc tatgttgcga ttgctgagaa tgaagcagcc ctggataaag 5040

tccacccgct gccggacctg ctgcattatt ccagcttgct gagccgtctg attaatgata 5100

tcggcactag cccggatgaa atggcgcgtg gtgacaatct gaagagcatt cactgctata 5160

tgaatgaaac cggtgccagc gaagaggtcg cacgcgagca catcaaaggc gtcatcgaag 5220

agaattggaa aattctgaac cagtgttgct ttgaccagtc ccagttccag gagccgttca 5280

tcacgtttaa cctgaacagc gtgcgcggct cgcatttctt ctatgaattt ggtgatggtt 5340

ttggtgttac cgacagctgg accaaggtgg atatgaaaag cgtcctgatt gatccgattc 5400

cgctgggtga agagtaagaa ttc 5423

<210> 93

<211> 53

<212> DNA

<213> 人工序列

<220>

<223> 诱变反向引物AV8-L358-rev

<220>

<221> misc_feature

<222> (22)..(22)

<223> n为a, c, g或t

<400> 93

cacgcggcat caccagcgga vncggcggat gcaggcgcag ggtttcttta atc 53

<210> 94

<211> 40

<212> DNA

<213> 人工序列

<220>

<223> 引物AV8-pcw-fw

<400> 94

catcgatgct taggaggtca tatggctctg ttattagcag 40

<210> 95

<211> 25

<212> DNA

<213> 人工序列

<220>

<223> 引物AV8-L358-fw

<400> 95

tccgctggtg atgccgcgtg agtgc 25

<210> 96

<211> 38

<212> DNA

<213> 人工序列

<220>

<223> 引物AV8-CPR-rev

<400> 96

atatatctcc ttcttaaagt tagtcgactc attaggtg 38

<210> 97

<211> 65

<212> DNA

<213> 人工序列

<220>

<223> 正向引物CPRm_aaBFS_Inf1

<400> 97

ttacctgcgt gatgtgtggt aataaaagct taggaggtaa aaatgtctac cctgccaatt 60

tcttc 65

<210> 98

<211> 63

<212> DNA

<213> 人工序列

<220>

<223> 反向引物AaBFS_Inf2

<400> 98

atgtttgaca gcttatcatc gataagctga attcttacac aaccatcggg tgcacaaaga 60

atg 63

<210> 99

<211> 65

<212> DNA

<213> 人工序列

<220>

<223> 正向引物CPRm_PaAFS_Inf1

<400> 99

ttacctgcgt gatgtgtggt aataaaagct taggaggtaa aaatggatct ggcagtggaa 60

atcgc 65

<210> 100

<211> 66

<212> DNA

<213> 人工序列

<220>

<223> 反向引物PaAFS_Inf2

<400> 100

ctcatgtttg acagcttatc atcgataagc tgaattctta catcgggacc ggctccagga 60

cggtgc 66

<210> 101

<211> 60

<212> DNA

<213> 人工序列

<220>

<223> 正向引物CPRm_Tps647_inf1

<400> 101

gcgtgatgtg tggtaataaa agcttaggag gtaaaaatgg cgaccgttgt ggatgattct 60

<210> 102

<211> 53

<212> DNA

<213> 人工序列

<220>

<223> 反向引物Tps647_Inf2

<400> 102

gcttatcatc gataagctga attcttactc ttcatccagg gtaatcgggt gga 53

<210> 103

<211> 58

<212> DNA

<213> 人工序列

<220>

<223> 正向引物CPRm_Tps30_Inf1

<400> 103

gcgtgatgtg tggtaataaa agcttaggag gtaaaaatgg acgcattcgc aacgagcc 58

<210> 104

<211> 59

<212> DNA

<213> 人工序列

<220>

<223> 反向引物Tps30_Inf2

<400> 104

gtgatgtgtg gtaataaaaa gctgaattct tagtcctctt cattcagcgg gatcgggtg 59

技术分类

06120112157475