What sort of workloads would be appropriate for use on Amazon EC2 Spot Instances?

2022-12-13 20:15 问答作者：

Amazon just announced "S开发者_C百科pot Instances" for their EC2 based infrastructure. I was wondering what sort of workloads would be appropriate for such service?

Spot Instances enable you to bid for unused Amazon EC2 capacity. Instances are charged the Spot Price set by Amazon EC2, which fluctuates periodically depending on the supply of and demand for Spot Instance capacity.

Interesting quote:

The philosophy of JIT is simple: inventory is waste.

EDITED:

I wonder if there are applications that could sustain themselves (read: be viable) just by leveraging a large volume of those Spot instances. Think about it: imagine you get 10 instances for the price of 1 on average... of course there wouldn't be guarantees but in the case that no Spot instances are available, a number of "normal" instances could be kicked-of.

Obviously this is for any workload that doesn't need to be real-time.

Let's say on smaller scale, how this could apply to stackoverflow? For example, many badges on this site are not calculated in real-time. There is periodical process that will evaluate eligibility and it doesn't matter whether it runs at 4am or 4pm everyday as long as it runs. Doing it at 4am could be 5 cents cheaper. (obviously they don't use EC2 at all for this)

Larger scale? Search engine over large set of data might need huge computing capacity to build its indexes. If you index new data once a day and it takes 2 hours to index them on hundreds of servers, you can do it overnight and save perhaps thousands of dollars every day.

By spreading workload around the clock helps Amazon maximize utilization of their resources and therefore provide the cheapest prices on the market.

Amazon could only think of these workloads:

Image and video processing, conversion and rendering
Scientific research data processing
Financial modeling and analysis

Spot Instances remind me of "double tariff electricity meters", where you pay less for energy when the demand is less. I think it is a very interesting concept, and quite an unexpected introduction to the cloud, but it will probably be difficult to apply to conventional business problems.

I am considering setting up a flexible cluster (say HADOOP) with a backbone that runs on regular instances and a few sets of additional instances at decreasing spot prices. As the price drops, additional instances become available to process work units. If the price increases, nodes will be shut down. The cluster handles this by re-issuing the work units to other nodes, just as it would in case of node failure.

Obviously this is a rather hostile environment so some adjustments need to be made. If you work with standard 3-fold replication for the global filesystem and the three nodes containing the block are shut down at the same time, you lose. Spreading the spot instance prices decreases the likelihood of losing many in one fell swoop. Increasing the replication factor will reduce the impact, and disk space is free with the instance anyway so that won't be a factor. Is this enough? We'll see.

There are obvious use cases such as batch processing or tasks that don't need to be running 24/7.

Other interesting implementations are for extra capacity. You could use a mix of on-demand and spot instance to run your website. The on-demand instances would serve as your 'core'. If your spot instances are down for a few hours here or there your on-demand instance may work a little harder but your website would still be accessible.

继续阅读：amazon-ec2

What sort of workloads would be appropriate for use on Amazon EC2 Spot Instances?

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？