minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	b16781846e	allow server to start even with corrupted/faulty disks (#10175 )	4 years ago
Harshavardhana	a880283593	Send the lower level error directly from GetDiskID() (#10095 ) this is to detect situations of corruption disk format etc errors quickly and keep the disk online in such scenarios for requests to fail appropriately.	4 years ago
Harshavardhana	17747db93f	fix: support healing older content (#10076 ) This PR adds support for healing older content i.e from 2yrs, 1yr. Also handles other situations where our config was not encrypted yet. This PR also ensures that our Listing is consistent and quorum friendly, such that we don't list partial objects	4 years ago
Harshavardhana	187c3f62df	fix: heal replaced drives properly (#10069 ) healing was not working properly when drives were replaced, due to the error check in root disk calculation this PR fixes this behavior This PR also adds additional fix for missing metadata entries from .minio.sys as part of disk healing as well. Added code to ignore and print more context sensitive errors for better debugging. This PR is continuation of fix in `7b14e9b660`	4 years ago
Anis Elleuch	fa211f6a10	heal: Fix healing delete markers (#9989 )	4 years ago
Anis Elleuch	c2f7cd1104	Consider errFileVersionNotFound during healing assessment (#9977 ) Healing an object which has multiple versions was not working because the healing code forgot to consider errFileVersionNotFound error as a use case that needs healing	4 years ago
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	4 years ago
Harshavardhana	62b1da3e2c	fix offline disk calculation (#9801 ) Current code was relying on globalEndpoints as the source of secondary truth to obtain the missing endpoints list when the disk is offline, this is problematic - there is no way to know if the getDisks() returned endpoints total is same as the ones list of globalEndpoints and it belongs to a particular set. - there is no order guarantee as getDisks() is ordered as per format.json, globalEndpoints may not be, so potentially end up including incorrect endpoints. To fix this bring getEndpoints() just like getDisks() to ensure that consistently ordered endpoints are always available for us to ensure that returned values are consistent with what each erasure set would observe.	5 years ago
Anis Elleuch	c045ae15e7	fix: avoid undoing bucket creation and return the first err instead (#9578 )	5 years ago
Harshavardhana	4c9de098b0	heal buckets during init and make sure to wait on quorum (#9526 ) heal buckets properly during expansion, and make sure to wait for the quorum properly such that healing can be retried.	5 years ago
Klaus Post	073aac3d92	add data update tracking using bloom filter (#9208 ) By monitoring PUT/DELETE and heal operations it is possible to track changed paths and keep a bloom filter for this data. This can help prioritize paths to scan. The bloom filter can identify paths that have not changed, and the few collisions will only result in a marginal extra workload. This can be implemented on either a bucket+(1 prefix level) with reasonable performance. The bloom filter is set to have a false positive rate at 1% at 1M entries. A bloom table of this size is about ~2500 bytes when serialized. To not force a full scan of all paths that have changed cycle bloom filters would need to be kept, so we guarantee that dirty paths have been scanned within cycle runs. Until cycle bloom filters have been collected all paths are considered dirty.	5 years ago
Anis Elleuch	2eeb0e6a0b	heal: Fix heal buckets result reporting (#9397 ) healBucket() was not properly collecting results after healing buckets. This commit adds After drives information correctly.	5 years ago
Harshavardhana	f44cfb2863	use GlobalContext whenever possible (#9280 ) This change is throughout the codebase to ensure that all codepaths honor GlobalContext	5 years ago
Harshavardhana	30707659b5	[feature] allow for an odd number of erasure packs (#9221 ) Too many deployments come up with an odd number of hosts or drives, to facilitate even distribution among those setups allow for odd and prime numbers based packs.	5 years ago
Harshavardhana	ba52a925f9	fix: delete dangling directories properly (#9222 )	5 years ago
Anis Elleuch	db2155551a	heal: Pass scan mode to HealObjects to deep scan full quorum objects (#9159 ) As an optimization of the healing, HealObjects() avoid sending an object to the background healing subsystem when the object is present in all disks. However, HealObjects() should have checked the scan type, if this deep, always pass the object to the healing subsystem.	5 years ago
Harshavardhana	e3b44c3829	Remove partName, partETag requirement (#9044 ) This is a precursor change before versioning, removes/deprecates the requirement of remembering partName and partETag which are not useful after a multipart transaction has finished. This PR reduces the overall size of the backend JSON for large file uploads.	5 years ago
Harshavardhana	cf37c7997e	Heal bucket only on missing drives in quorum (#8883 ) MakeVol shouldn't be called in heal bucket when bucket doesn't really exist in quorum.	5 years ago
Nitish Tiwari	3df7285c3c	Add Support for Cache and S3 related metrics in Prometheus endpoint (#8591 ) This PR adds support below metrics - Cache Hit Count - Cache Miss Count - Data served from Cache (in Bytes) - Bytes received from AWS S3 - Bytes sent to AWS S3 - Number of requests sent to AWS S3 Fixes #8549	5 years ago
Harshavardhana	fb43d64dc3	Fix healing on multiple zones (#8555 ) It is expected in zone healing underlying callers should return appropriate errors	5 years ago
Harshavardhana	347b29d059	Implement bucket expansion (#8509 )	5 years ago
Harshavardhana	e9b2bf00ad	Support MinIO to be deployed on more than 32 nodes (#8492 ) This PR implements locking from a global entity into a more localized set level entity, allowing for locks to be held only on the resources which are writing to a collection of disks rather than a global level. In this process this PR also removes the top-level limit of 32 nodes to an unlimited number of nodes. This is a precursor change before bring in bucket expansion.	5 years ago
Anis Elleuch	8cc5ecec23	xl: Fix locking in xl HealObject (#8455 ) Move locking to the correct location, before loading object data.	5 years ago
Harshavardhana	68a519a468	Use errgroups instead of sync.WaitGroup as needed (#8354 )	5 years ago
Harshavardhana	ff5bf51952	admin/heal: Fix deep healing to heal objects under more conditions (#8321 ) - Heal if the part.1 is truncated from its original size - Heal if the part.1 fails while being verified in between - Heal if the part.1 fails while being at a certain offset Other cleanups include make sure to flush the HTTP responses properly from storage-rest-server, avoid using 'defer' to improve call latency. 'defer' incurs latency avoid them in our hot-paths such as storage-rest handlers. Fixes #8319	5 years ago
Harshavardhana	53e4887e02	Simplify and cleanup metadata r/w functions (#8146 )	5 years ago
Harshavardhana	e6d8e272ce	Use const slashSeparator instead of "/" everywhere (#8028 )	5 years ago
Anis Elleuch	c5ac901e8d	xl: Fix healing empty directories (#8013 ) After some extensive refactors, it turned out empty directories are not healed and heal status is also not reported correctly. This commit fixes it and adds the appropriate unit tests	5 years ago
Anis Elleuch	000a60f238	xl: Heal empty parts (#7860 ) posix.VerifyFile() doesn't know how to check if a file is corrupted if that file is empty. We do have the part size in xl.json so we pass it to VerifyFile to return an error so healing empty parts can work properly.	5 years ago
Krishna Srinivas	58d90ed73c	Avoid network transfer for bitrot verification during healing (#7375 )	5 years ago
Krishna Srinivas	338e9a9be9	Put object client disconnect (#7824 ) Fail putObject and postpolicy in case client prematurely disconnects Use request's context to cancel lock requests on client disconnects	5 years ago
Anis Elleuch	27ef1262bf	xl: Use random UUID during complete multipart upload (#7527 ) One user has seen this following error log: API: CompleteMultipartUpload(bucket=vertica, object=perf-dss-v03/cc2/02596813aecd4e476d810148586c2a3300d00000013557ef_0.gt) Time: 15:44:07 UTC 04/11/2019 RequestID: 159475EFF4DEDFFB RemoteHost: 172.26.87.184 UserAgent: vertica-v9.1.1-5 Error: open /data/.minio.sys/tmp/100bb3ec-6c0d-4a37-8b36-65241050eb02/xl.json: file exists 1: cmd/xl-v1-metadata.go:448:cmd.writeXLMetadata() 2: cmd/xl-v1-metadata.go:501:cmd.writeUniqueXLMetadata.func1() This can happen when CompleteMultipartUpload fails with write quorum, the S3 client will retry (since write quorum is 500 http response), however the second call of CompleteMultipartUpload will fail because this latter doesn't truly use a random uuid under .minio.sys/tmp/ directory but pick the upload id. This commit fixes the behavior to choose a random uuid for generating xl.json	6 years ago
Harshavardhana	f767a2538a	Optimize listing with leaf check offloaded to posix (#7541 ) Other listing optimizations include - remove double sorting while filtering object entries - improve error message when upload-id is not in quorum - use jsoniter for full unmarshal json, instead of gjson - remove unused code	6 years ago
kannappanr	5ecac91a55	Replace Minio refs in docs with MinIO and links (#7494 )	6 years ago
Harshavardhana	c90999df98	Valid if bucket names are internal (#7476 ) This commit fixes a privilege escalation issue against the S3 and web handlers. An authenticated IAM user can: - Read from or write to the internal '.minio.sys' bucket by simply sending a properly signed S3 GET or PUT request. Further, the user can - Read from or write to the internal '.minio.sys' bucket using the 'Upload'/'Download'/'DownloadZIP' API by sending a "browser" request authenticated with its JWT token.	6 years ago
Harshavardhana	4a698c731b	HealObjects should remove objects without quorum (#7407 ) This PR adds a way to list objects without quorum such that they can purged by `mc admin heal --remove`	6 years ago
Anis Elleuch	facbd653ba	Add normal/deep type of heal scanning (#7251 ) Healing scan used to read all objects parts to check for bitrot checksum. This commit will add a quicker way of healing scan by only checking if parts are actually present in disks or not.	6 years ago
Harshavardhana	082f777281	Revamp bucket metadata healing (#7208 ) Bucket metadata healing in the current code was executed multiple times each time for a given set. Bucket metadata just like objects are hashed in accordance with its name on any given set, to allow hashing to play a role we should let the top level code decide where to navigate. Current code also had 3 bucket metadata files hardcoded, whereas we should make it generic by listing and navigating the .minio.sys to heal such objects. We also had another bug where due to isObjectDangling changes without pre-existing bucket metadata files, we were erroneously reporting it as grey/corrupted objects. This PR fixes all of the above items.	6 years ago
Harshavardhana	30135eed86	Redo how to handle stale dangling files (#7171 ) foo.CORRUPTED should never be created because when multiple sets are involved we would hash the file to wrong a location, this PR removes the code. But allows DeleteBucket() to work properly to delete dangling buckets/objects. Also adds another option to Healing where a user needs to specify `--remove` such that all dangling objects will be deleted with user confirmation.	6 years ago
Krishna Srinivas	b18c0478e7	Only heal on disks where we are sure that healing is needed (#7148 )	6 years ago
Anis Elleuch	2d9860e875	heal: Fix healing empty directories (#7154 ) This commit fixes the computation of Before/After healing state for empty directories. Issues before the commit: - Before state doesn't reflect the real status (no StatVol() called) - For any MakeVol() error, healObjectDir is exited directly, which is wrong.	6 years ago
kannappanr	d3553f8dfc	Bucket Heal: Do not add empty endpoint entry (#7172 ) Currently during a heal of a bucket, if one disk is offline an empty endpoint entry is added. Then another entry with the missing endpoint is also added. This results in more entries than disks being added. Code that adds empty endpoint has been removed.	6 years ago
Krishna Srinivas	51ec61ee94	Fix healing whole file bitrot (#7123 ) * Use 0-byte file for bitrot verification of whole-file-bitrot files Also pass the right checksum information for bitrot verification * Copy xlMeta info from latest meta except []checksums and []Parts while healing	6 years ago
Krishna Srinivas	98c950aacd	Streaming bitrot verification support (#7004 )	6 years ago
Harshavardhana	bfb505aa8e	Refactor logging in more Go idiomatic style (#6816 ) This refactor brings a change which allows targets to be added in a cleaner way and also audit is now moved out. This PR also simplifies logger dependency for auditing	6 years ago
Harshavardhana	8491a29ec3	Fix healing bucket properly (#6716 ) Bucket should be healed properly if it partially exists on only one set, since bucket is common for all sets. Fixes #6710	6 years ago
Harshavardhana	223967fd32	Return always a default heal item upon unexpected error (#6556 ) Never return an empty result item even upon error, choose all the default values and based on the errors make sure to send right result reply.	6 years ago
Harshavardhana	b4772849f9	Heal recursively all entries in config/ prefix (#6545 ) This to ensure that we heal all entries in config/ prefix, we will have IAM and STS related files which are being introduced in #6168 PR This is a change to ensure that we heal all of them properly, not just `config.json`	6 years ago
Harshavardhana	aebfceeafb	Heal backend configuration file (#6532 ) Fixes #6461	6 years ago
Krishna Srinivas	52f6d5aafc	Rename of structs and methods (#6230 ) Rename of ErasureStorage to Erasure (and rename of related variables and methods)	6 years ago

7 Commits (9a703befe69118594be6667970b7b7c13f0b58bf)