<rt id="bn8ez"></rt>
<label id="bn8ez"></label>

  • <span id="bn8ez"></span>

    <label id="bn8ez"><meter id="bn8ez"></meter></label>

    無為

    無為則可為,無為則至深!

      BlogJava :: 首頁 :: 聯系 :: 聚合  :: 管理
      190 Posts :: 291 Stories :: 258 Comments :: 0 Trackbacks

    There is every reason to do capacity planning for the data warehouse, DSS environment. Data warehouses grow at a tremendous rate. Data warehouse cost money. Data warehouses operate on a variety of technologies.Data warehouses have performance problems just like other technological environments.Therefore it only makes sense to plan your data warehouse environment.In doing so, you place your corporation in a proactive position, not a reactive one.

    Who Does Data Warehouse Capacity Planning?

    But come on. How many corporations do capacity planning for the data warehouse, DSS environment? And of those few corporations that actually do capacity planning, how many have done capacity planning for the data warehouse environment well and effectively?

    The answer is that there are at best a handful of companies that have done data warehouse, DSS capacity planning effectively.

    Does that mean that data warehouse DSS capacity planning should not be done? Not at all. There is every good reason to do capacity planning for the data warehouse environment. It just that capacity planning for the data warehouse, DSS environment is ... gulp...hard to do. It is ...gulp, gulp ... "different" than anything that most capacity planners have ever done before. therefore capacity planning for the data warehouse DSS environment is an oxymoron.

    Three Approaches

    This article will describe three approaches to doing capacity planning for the data warehouse DSS environment. After reading about the three approaches, you can decide why capacity planning for the data warehouse, DSS environment is so difficult. And if you are one of the hardy souls who likes to be proactive, you might even choose one of the approaches.

    What Needs To Be Planned?

    What is it that needs to be planned in the data warehouse DSS environment?While there are many facets to the data warehouse DSS environment, the two most important aspects of capacity planning are planning for storage and planning for processors.

    It is noteworthy that there are plenty of other things that come with the territory:

    • ???????? what dbms to use,
    • ???????? what form of networking to be used,
    • ???????? what end user access and analysis tools should be used,
    • ???????? what kind of integration and transformation tools should be used, and so forth.

    But the heart of the matter is how much storage and how many and what kind of processors should be used for the data warehouse.

    Key Factors

    The two key factors the capacity planner looks at are the amount of data there will be and the workload that will be run against the data warehouse.

    Unfortunately, both of these factors in the data warehouse DSS environment are very difficult to ascertain.

    The Analytical Approach

    The first approach to capacity planning is the analytical approach.The analytical approach is one where the capacity planner attempts to calculate and/or predict capacity needs before the equipment is purchased.In the analytical approach the analyst attempts to quantify such things as:

    • ???????? how many customers will be in the warehouse?,
    • ???????? at what rate will the customers grow?,
    • ???????? how many transactions will be in the warehouse?,
    • ???????? at what rate will the transactions grow?,
    • ???????? what other data will be in the warehouse?,
    • ???????? at what rate will the other data grow?,
    • ???????? what is the proper level of granularity for data in the warehouse?,
    • ???????? can the level of granularity be changed if needed?,
    • ???????? what amount of history is needed in the warehouse?,
    • ???????? will the user decide to add more history than anticipated?, and so forth.

    Each of these interrelated questions must be answered in order for the analyst to determine how much data there will be in the warehouse. And if you have ever struggled through an exercise of trying to accurately predict these questions, you know that accuracy is very difficult to come by. In all honesty, a good guess is about the best that can be achieved.

    But volumes of data are only one aspect of capacity planning. The other side of capacity planning in the data warehouse DSS environment is that of workload projection. And if you thought trying to predict volumes of data was difficult, wait until you try to predict what the workload for the DSS environment is going to look like.

    There are many factors that must be considered when trying to profile the data warehouse DSS workload. Some of the more interesting factors are:

    • ???????? how many farmers will you have?
    • ???????? how many explorers do you have?
    • ???????? what does the average farmer query look like?
    • ???????? what does the pattern of submission for the farmers look like?
    • ???????? what does the explorer query look like?
    • ???????? is there any pattern to the submission of analysis by the explorer community?
    • ???????? has the explorer community ever been addressed or graced with an infrastructure before?
    • ???????? are there predictable peaks and valleys of processing:
      • throughout the day?
      • throughout the month?
      • ?throughout the quarter?
    • ???????? will there be an attempt to use a resource governor? And so on.

    There are then many questions that need to be answered in order to portray the data warehouse DSS workload. As in the case of volumes of data, an accurate picture simply cannot be painted.

    But perhaps the biggest enemy of the analytical approach is that of the attitude of the community of people using the data warehouse DSS environment. In most cases the data warehouse DSS environment is one of discovery. People simply don't know what is going to happen until they get there. People don't know what they will do until they know what the possibilities are. And where people really don't know what they will do, trying to look into a crystal ball and predict what will happen is black magic.

    The Calibrated Extrapolation Approach

    Which leads to the second approach to capacity planning. That approach is the calibrated extrapolation approach.The calibrated extrapolation approach is one where there is at best a rudimentary attempt at analytical capacity planning. But after the first or second iteration of the warehouse is created and after the first few users have become enamored of the data warehouse, then careful track is kept for the warehouse and its usage. Over calibrated periods of time, the growth of the warehouse is tracked. Based on the incremental growth that is being measured, an extrapolation of future capacity needs is made.The extrapolation of capacity needs then becomes an educated guess.Of course the educated guess can be refined.The analyst can factor in known growth factors such as addition of new subject areas, addition of history, and the like. In doing so the analyst combines the best of the calibrated extrapolation approach and the analytical approach.

    But even when the calibrated extrapolation approach is used wisely and well, the calibrated extrapolation approach has only a short time horizon for effectiveness. In other words, trying to project outward into the long term future using the calibrated extrapolation approach is a dicey venture. Extrapolation can be done for three months or maybe even for six months. But anything beyond that is questionable.

    The Copycat Approach

    The third approach is the "copycat" approach. In the copycat approach you go and find some company with roughly the same characteristics as your company but where the company has advanced into data warehousing further than your company. In this case you simply ask what environment they are operating in and ask how things are going. The copycat approach is by far the easiest approach. When the copycat approach to capacity planning works well, nothing beats it.

    But there are pitfalls with the copycat approach. Some of the pitfalls are:

    • ???????? the company being examined has not fed you accurate information,
    • ???????? the company being examined has fundamental business and technological differences which you are not aware of,
    • ???????? the company being examined is affected by and is responding to business pressures which you are not aware of, and so forth,

    All of these factors mean that the comparison between your company and the examined company may produce very misleading results.

    ?

    Enter The Vendor

    There is of course a fourth alternative. That alternative is to let a hardware vendor come in and do capacity planning for you. This is surely the laziest way to go. But don't be surprised when the vendor discovers that the only way to meet your capacity needs is to buy the vendors hardware. In short, the capacity planning done by hardware and dbms vendors is an exercise in subtle hard selling. You may actually get some useful capacity projections. You will certainly get a hard sale for the vendors products.



    凡是有該標志的文章,都是該blog博主Caoer(草兒)原創,凡是索引、收藏
    、轉載請注明來處和原文作者。非常感謝。

    posted on 2006-09-07 13:41 草兒 閱讀(448) 評論(0)  編輯  收藏 所屬分類: BI and DM
    主站蜘蛛池模板: 亚洲v高清理论电影| 亚洲综合精品网站| 亚洲性色高清完整版在线观看| 免费国产叼嘿视频大全网站| 亚洲大成色www永久网站| 中文字幕乱理片免费完整的| 久久亚洲色一区二区三区| 久久国产乱子伦精品免费午夜 | 精品国产日韩久久亚洲| 最近最好的中文字幕2019免费| 亚洲国产成人综合| 免费A级毛片无码免费视| 亚洲成av人片在线天堂无| 日本不卡高清中文字幕免费| 久久无码av亚洲精品色午夜| 亚洲国产综合久久天堂| 国产精品hd免费观看| 久久精品国产亚洲AV网站| 日韩视频在线精品视频免费观看 | 免费人成在线观看播放a| 亚洲女同成av人片在线观看 | 在线观看免费a∨网站| 西西人体大胆免费视频| 亚洲午夜久久久影院| 最近2019中文字幕免费大全5| 亚洲人成图片网站| 亚洲中文字幕视频国产| 亚洲视频在线观看免费| 亚洲人成未满十八禁网站| 三上悠亚亚洲一区高清| 最近中文字幕mv免费高清视频8| 最新国产精品亚洲| 亚洲线精品一区二区三区 | 香蕉成人免费看片视频app下载| 亚洲色欲色欲www| 亚洲国产精品碰碰| 黄+色+性+人免费| 一区二区三区精品高清视频免费在线播放| 亚洲国产精品高清久久久| 成人激情免费视频| 国产麻豆一精品一AV一免费 |