minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	e019f21bda	fix: trigger heal if one of the parts are not found (#11358 ) Previously we added heal trigger when bit-rot checks failed, now extend that to support heal when parts are not found either. This healing gets only triggered if we can successfully decode the object i.e read quorum is still satisfied for the object.	4 years ago
Harshavardhana	f21d650ed4	fix: readData in bulk call using messagepack byte wrappers (#11228 ) This PR refactors the way we use buffers for O_DIRECT and to re-use those buffers for messagepack reader writer. After some extensive benchmarking found that not all objects have this benefit, and only objects smaller than 64KiB see this benefit overall. Benefits are seen from almost all objects from 1KiB - 32KiB Beyond this no objects see benefit with bulk call approach as the latency of bytes sent over the wire v/s streaming content directly from disk negate each other with no remarkable benefits. All other optimizations include reuse of msgp.Reader, msgp.Writer using sync.Pool's for all internode calls.	4 years ago
Harshavardhana	d0027c3c41	do not use large buffers if not necessary (#11220 ) without this change, there is a performance regression for small objects GETs, this makes the overall speed to go back to pre '59d363' commit days.	4 years ago
Harshavardhana	c4131c2798	feat: Small object optimization read data in single bulk call (#11207 )	4 years ago
Harshavardhana	029758cb20	fix: retain the previous UUID for newly replaced drives (#10759 ) only newly replaced drives get the new `format.json`, this avoids disks reloading their in-memory reference format, ensures that drives are online without reloading the in-memory reference format. keeping reference format in-tact means UUIDs never change once they are formatted.	4 years ago
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	4 years ago
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	4 years ago
Klaus Post	4a007e3767	Prefer local disks when fetching data blocks (#9563 ) If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead. It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger 1MB objects, before: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 76257: * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms Throughput: * Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST) ``` After: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 78845: * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms Throughput: * Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST) ``` Bonus fix: Only ask for heal once on an object.	5 years ago
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	6 years ago
Harshavardhana	df35d7db9d	Introduce staticcheck for stricter builds (#7035 )	6 years ago
Krishna Srinivas	98c950aacd	Streaming bitrot verification support (#7004 )	6 years ago
Krishna Srinivas	52f6d5aafc	Rename of structs and methods (#6230 ) Rename of ErasureStorage to Erasure (and rename of related variables and methods)	6 years ago
Krishna Srinivas	ce02ab613d	Simplify erasure code by separating bitrot from erasure code (#5959 )	6 years ago
kannappanr	f8a3fd0c2a	Create logger package and rename errorIf to LogIf (#5678 ) Removing message from error logging Replace errors.Trace with LogIf	7 years ago
Harshavardhana	c0721164be	Automatically set goroutines based on shardSize (#5346 ) Update reedsolomon library to enable feature to automatically set number of go-routines based on the input shard size, since shard size is sort of a constant in Minio for objects > 10MiB (default blocksize) klauspost reported around 15-20% improvement in performance numbers on older systems such as AVX and SSE3 ``` name old speed new speed delta Encode10x2x10000-8 5.45GB/s ± 1% 6.22GB/s ± 1% +14.20% (p=0.000 n=9+9) Encode100x20x10000-8 1.44GB/s ± 1% 1.64GB/s ± 1% +13.77% (p=0.000 n=10+10) Encode17x3x1M-8 10.0GB/s ± 5% 12.0GB/s ± 1% +19.88% (p=0.000 n=10+10) Encode10x4x16M-8 7.81GB/s ± 5% 8.56GB/s ± 5% +9.58% (p=0.000 n=10+9) Encode5x2x1M-8 15.3GB/s ± 2% 19.6GB/s ± 2% +28.57% (p=0.000 n=9+10) Encode10x2x1M-8 12.2GB/s ± 5% 15.0GB/s ± 5% +22.45% (p=0.000 n=10+10) Encode10x4x1M-8 7.84GB/s ± 1% 9.03GB/s ± 1% +15.19% (p=0.000 n=9+9) Encode50x20x1M-8 1.73GB/s ± 4% 2.09GB/s ± 4% +20.59% (p=0.000 n=10+9) Encode17x3x16M-8 10.6GB/s ± 1% 11.7GB/s ± 4% +10.12% (p=0.000 n=8+10) ```	7 years ago
Andreas Auernhammer	b97f99766f	add benchmarks for erasure backend (#5084 ) This change adds benchmarks for erasure read/write in different setups.	7 years ago
Andreas Auernhammer	02af37a394	optimize memory allocs during reconstruct (#4964 ) The reedsolomon library now avoids allocations during reconstruction. This change exploits that to reduce memory allocs and GC preasure during healing and reading.	7 years ago
Andreas Auernhammer	7e6b5bdbb7	remove ReadFileWithVerify from StorageAPI (#4947 ) This change removes the ReadFileWithVerify function from the StorageAPI. The ReadFile was basically a redirection to ReadFileWithVerify. This change removes the redirection and moves the logic of ReadFileWithVerify directly into ReadFile. This removes a lot of unnecessary code in all StorageAPI implementations. Fixes #4946 * review: fix doc and typos	7 years ago
Andreas Auernhammer	85fcee1919	erasure: simplify XL backend operations (#4649 ) (#4758 ) This change provides new implementations of the XL backend operations: - create file - read file - heal file Further this change adds table based tests for all three operations. This affects also the bitrot algorithm integration. Algorithms are now integrated in an idiomatic way (like crypto.Hash). Fixes #4696 Fixes #4649 Fixes #4359	7 years ago
Anis Elleuch	af8071c86a	xl: Fix rare freeze after many disk/network errors (#4438 ) xl.storageDisks is sometimes passed to some low-level XL functions. Some disks in xl.storageDisks are set to nil when they encounter some errors. This means all elements in xl.storageDisks will be nil after some time which lead to an unusable XL.	8 years ago
Aditya Manthramurthy	8975da4e84	Add new ReadFileWithVerify storage-layer API (#4349 ) This is an enhancement to the XL/distributed-XL mode. FS mode is unaffected. The ReadFileWithVerify storage-layer call is similar to ReadFile with the additional functionality of performing bit-rot checking. It accepts additional parameters for a hashing algorithm to use and the expected hex-encoded hash string. This patch provides significant performance improvement because: 1. combines the step of reading the file (during erasure-decoding/reconstruction) with bit-rot verification; 2. limits the number of file-reads; and 3. avoids transferring the file over the network for bit-rot verification. ReadFile API is implemented as ReadFileWithVerify with empty hashing arguments. Credits to AB and Harsha for the algorithmic improvement. Fixes #4236.	8 years ago
Bala FA	de204a0a52	Add extensive endpoints validation (#4019 )	8 years ago
Bala FA	1c97dcb10a	Add UTCNow() function. (#3931 ) This patch adds UTCNow() function which returns current UTC time. This is equivalent of UTCNow() == time.Now().UTC()	8 years ago
Harshavardhana	bcc5b6e1ef	xl: Rename getOrderedDisks as shuffleDisks appropriately. (#3796 ) This PR is for readability cleanup - getOrderedDisks as shuffleDisks - getOrderedPartsMetadata as shufflePartsMetadata Distribution is now a second argument instead being the primary input argument for brevity. Also change the usage of type casted int64(0), instead rely on direct type reference as `var variable int64` everywhere.	8 years ago
Anis Elleuch	e9394dc22d	xl PutObject: Split object into parts (#3651 ) For faster time-to-first-byte when we try to download a big object	8 years ago
Bala FA	825000bc34	Use humanize constants for KiB, MiB and GiB units. (#3322 )	8 years ago
Harshavardhana	c91d3791f9	heal: Add healing support for bucket, bucket metadata files. (#3252 ) This patch implements healing in general but it is only used as part of quickHeal(). Fixes #3237	8 years ago
Harshavardhana	716316f711	Reduce number of envs and options from command line. (#3230 ) Ref #3229 After review with @abperiasamy we decided to remove all the unnecessary options - MINIO_BROWSER (Implemented as a security feature but now deemed obsolete since even if blocking access to MINIO_BROWSER, s3 API port is open) - MINIO_CACHE_EXPIRY (Defaults to 72h) - MINIO_MAXCONN (No one used this option and we don't test this) - MINIO_ENABLE_FSMETA (Enable FSMETA all the time) Remove --ignore-disks option - this option was implemented when XL layer would initialize the backend disks and heal them automatically to disallow XL accidentally using the root partition itself this option was introduced. This behavior has been changed XL no longer automatically initializes `format.json` a HEAL is controlled activity, so ignore-disks is not useful anymore. This change also addresses the problems of our documentation going forward and keeps things simple. This patch brings in reduction of options and defaulting them to a valid known inputs. This patch also serves as a guideline of limiting many ways to do the same thing.	8 years ago
Harshavardhana	9e2d0ac50b	Move to URL based syntax formatting. (#3092 ) For command line arguments we are currently following - <node-1>:/path ... <node-n>:/path This patch changes this to - http://<node-1>/path ... http://<node-n>/path	8 years ago
Krishna Srinivas	32c3a558e9	distributed-XL: Support to run one minio process per export even on the same machine. (#2999 ) fixes #2983	8 years ago
Harshavardhana	6494b77d41	server: Add more elaborate startup messages. (#2731 ) These messages based on our prep stage during XL and prints more informative message regarding drive information. This change also does a much needed refactoring.	8 years ago
Harshavardhana	113b93346b	lock: Make some cleanup and moving the code around. (#2718 ) This patch just avoids lot of ifs and inverts some logic.	8 years ago
Harshavardhana	03430d0db8	tests: Add ListBucketHandler tests. (#2701 ) part-3 final fix for #2412	8 years ago
Krishna Srinivas	9358ee011b	logging: Print stack trace in case of errors. fixes #1827	8 years ago
Krishnan Parthasarathi	de67bca211	Move formatting of disks out of object layer initialization (#2572 )	8 years ago
Mohit Agarwal	418921de89	minor cleanup - Reused contains() from utils.go at a couple of places - Cleanup in return statements and boolean checks	8 years ago
Harshavardhana	bccf549463	server: Move all the top level files into cmd folder. (#2490 ) This change brings a change which was done for the 'mc' package to allow for clean repo and have a cleaner github drop in experience.	8 years ago
karthic rao	a3592228f5	bug-fix: fix for tests failure when cache is disabled (#2439 )	8 years ago
karthic rao	70fd38818e	clean up: ineffassign fixes (#2411 )	8 years ago
Harshavardhana	f503ac3db8	XL/Erasure: Make bit-rot verification based on xl.json algo. (#2299 ) Currently `xl.json` saves algorithm information for bit-rot verification. Since the bit-rot algo's can change in the future make sure the erasureReadFile doesn't default to a particular algo. Instead use the checkSumInfo.	8 years ago
Krishna Srinivas	043ddbd834	optimize memory allocation during erasure-read by using temporary buffer pool. (#2259 ) * XL/erasure-read: optimize memory allocation during erasure-read by using temporary buffer pool. With the change the buffer needed during GetObject by erasureReadFile is allocated only once.	8 years ago
Krishna Srinivas	18728a0b59	XL/erasure-read: refactor erasure read and add tests (#2232 )	8 years ago
Krishna Srinivas	897d78d113	erasureReadFile and erasureCreateFile testcases. (#2229 ) * unit-tests: Unit tests for erasureCreateFile and erasureReadFile. * appendFile() should return errXLWriteQuorum. * TestErasureReadFileOffsetLength() tests erasureReadFile() for different offset and lengths. * Fix for the failure seen in the erasure read unit test case. Issue #2227 * Move common erasure setup code to newErasureTestSetup() * Review fixes. Add few more test cases for erasureReadFile.	8 years ago
Krishna Srinivas	b090c7112e	Refactor of xl.PutObjectPart and erasureCreateFile. (#2193 ) * XL: Refactor of xl.PutObjectPart and erasureCreateFile. * GetCheckSum and AddCheckSum methods for xlMetaV1 * Simple unit test case for erasureCreateFile()	8 years ago
Harshavardhana	a4a55bf134	tests: Fix erasure-readfile test formatting.	8 years ago
Krishna Srinivas	d6dfcd0ba7	unit-tests: Unit tests for functions in erasure-readfile.go (#2090 )	8 years ago

12 Commits (f9c5636c2dfc45dced8b8bc0c856800c7093f06b)