There is one thing computers do really well, and that is iterating thru lists.<br>I would suggest that getting the list from memcache and then processing it is not going to be any slower than getting a list of results from a heavy database query... and... iterating through that.&nbsp; Especially if you use the get_multi instead of thousands of individual gets (but that take a little extra programming to detect an element that wasn&#39;t in the cache, but you might get better over-all response times)

<br><br>Of course, its always a useful thing to then store in memcache your PROCESSED data.&nbsp; So after you&#39;ve iterated through your list and formatted or ordered the data in some format that is useful for you, cache it.&nbsp; Then next time, use the cached version and you don&#39;t have to iterate it again.&nbsp; If your data changes frequently, figure out the delta and use it as your expiry time.

<br><br><div><span class="gmail_quote">On 9/21/07, <b class="gmail_sendername">K J</b> &lt;<a href="mailto:sanbat@gmail.com">sanbat@gmail.com</a>&gt; wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">

<div><span class="q">

<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0px 0px 0px 0.8ex; padding-left: 1ex;">

<div style="">

<div>I just did some hand-wavy math on that:</div>

<div><br>&nbsp;</div>

<div>

<div>&gt;&gt;&gt; import random</div>

<div>&gt;&gt;&gt; r=random.Random()</div>

<div>&gt;&gt;&gt; import sets</div>

<div>&gt;&gt;&gt; s=sets.Set([r.randint(1, 10000000) for x in range(30000)])</div>

<div>&gt;&gt;&gt; len(s)</div>

<div>29950</div>

<div>&gt;&gt;&gt; len(&#39;,&#39;.join([str(i) for i in s]))</div>

<div>236301</div>

<div>&gt;&gt;&gt; import zlib</div>

<div>

<div>&gt;&gt;&gt; compressed=zlib.compress(&#39;,&#39;.join([str(i) for i in s]), 9)</div>

<div>&gt;&gt;&gt; len(compressed)</div>

<div>109568</div><br>&nbsp;</div></div>

<div><span style="white-space: pre;"></span>So, a user with about 30k friends with numeric IDs fairly even distributed from 1 to 10,000,000 (my average ID was&nbsp;4,979,961, min was 236, max was 9,999,931) would take a little over 100k of cache when comma separated and compressed.

</div>

<div><br>&nbsp;</div>

<div><span style="white-space: pre;"></span>I would expect the normal case to be much smaller.</div></div></blockquote>

<div>&nbsp;</div></span>

<div>Great analysis!</div>

<div>&nbsp;</div>

<div>Thinking further on this issue though, gives me more questions...</div>

<div>&nbsp;</div>

<div>Storing a list of a user&#39;s friends in a Memcache array, then getting them back in and searching through them... while it avoids the database altogether, wouldn&#39;t it be sorta slow?&nbsp; For instance, every time a user visits user B&#39;s profile page, the system would need to iterate through this array and see if they are connected.&nbsp;

</div>

<div>&nbsp;</div>In this case would it be just better to have a heap mysql database and query that instead?<br>&nbsp;</div>

</blockquote></div><br><br clear="all"><br>-- <br>&quot;Be excellent to each other&quot;