We're evaluating an F540 and I'm wondering how many of you compared them against the Suns. Specifically we currently run 2 Ultra 170's plus arrays which both see about 1200-1400 nfs ops/sec average with frequent peaks of 2000 on one. The one box holds common areas, the other builds. I'm considering upgrading these to 2 Ultra 2's with dual 300Mhz chips (for performance and arrays), or one F630 (200GB).
From what I can see of the LADDIS numbers, the NetApps respond faster
than the Ultra 2 range and its not until you get to the Ultra 3000 that the response times are comparable, even though the Ultra 2's would sustain higher nfs ops. Is this something people have seen in practice?
I'm initially considering one F630 for price reasons and since the one box sees higher nfs/ops but the other box is the one needing the extra storage. The 'limit' of 4500 nfs/ops on the F630 concerns me in that combining the current nfs ops can easily put use near the upper limit of the box. What I don't have a feel for is how many nfs ops the current workload would actually put on a filer. If it responds faster to clients, then I'd actually expect the nfs ops to be even higher, putting me even nearer the limit or above it (although I know its not limited to 4500 nfs/ops, I assume its response would drop significantly). Have people found this to be the case?
Two F540's are a possibility, but we seem to be approaching their upper limit and if we go over there isn't room for expandability, whereas we could (eventually) add a second 630 and split the serving again if we choose to go this route. What kind of loading are people seeing on their netapps, e.g. 500nfs ops, 2000 nfs ops, any over the limit of the F540?
Dave
+--- In our lifetime, Dave Heiland dheiland@lucent.com wrote: | | We're evaluating an F540 and I'm wondering how many of you compared them | against the Suns. Specifically we currently run 2 Ultra 170's plus | arrays which both see about 1200-1400 nfs ops/sec average with frequent
We compared them briefly. It depends on what you are looking for. I have 100GB on my f540. If I put that much space on a Sun, it would take forever to boot, esp after a crash.
Since fast recovery was one of the main criteria, the Sun's were discounted. Even with the Veritas FS, it still has some pretty serious flaws.
There are folks out there running big Suns as NFS servers. Not sure how they like it.
Alexei
The 'limit' of 4500 nfs/ops on the F630 concerns me in that combining the current nfs ops can easily put use near the upper limit of the box. What I don't have a feel for is how many nfs ops the current workload would actually put on a filer.
It is important for people to understand that LADDIS is a very hard load, and in real-world environments people will often see much higher throughput than LADDIS would seem to predict. I recently got a note from a customer "complaining" that his F230 was running at a sustained load of over 3300 ops per second, which is over double the LADDIS "limit" for that system.
How can that be? Here are some reasons:
(1) Some NFS operations are much harder than others. READ and WRITE, for instance, are hard, because they move data. GETATTR and LOOKUP are easy, because they do simple lookups.
LADDIS does 22% READs and 15% WRITEs. LADDIS does 50% of GETATTRs plus LOOKUPs. Run nfsstat to see what your current servers are doing. Most sites have fewer hard ops and more easy ones.
Also, look at the network load to your server. LADDIS generates about 1.5 MB of reads and 1 MB of writes per thousand LADDIS ops. So 4500 LADDIS ops represent a total of 11 MB/sec. (That's mega-bytes, not mega-bits.) If the network load to your server is lighter, then your load is probably lighter than LADDIS.
(2) LADDIS has a very poor cache-hit rate, so disk I/O becomes a bottleneck. (LADDIS goes out of it's way to access a very large working set.)
If your working set fits reasonably well in memory, your system will perform much better. Look at the cache age or cache hit statistics on your existing server.
My point isn't that LADDIS is a bad benchmark! To test a server to it's breaking point an I/O intense load is a good thing. And a few real-world loads are as hard as LADDIS.
But don't assume that LADDIS results will limit the performance that you see in a particular real-world environment.
Dave