minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	d55f4336ae	preserve context per request for local locks (#9828 ) In the Current bug we were re-using the context from previously granted lockers, this would lead to lock timeouts for existing valid read or write locks, leading to premature timeout of locks. This bug affects only local lockers in FS or standalone erasure coded mode. This issue is rather historical as well and was present in lsync for some time but we were lucky to not see it. Similar changes are done in dsync as well to keep the code more familiar Fixes #9827	5 years ago
ethan ho	535efd34a0	Fix peer server update failure (#9824 ) When updating all servers following the constructions of mc update, only the endpoint server will be updated successfully. All the other peer servers' updating failed due to the error below: -------------------------------------------------------------------------- parsing time "2006-01-02T15:04:05Z07:00" as "<release version>": cannot parse "-01-02T15:04:05Z07:00" as "0-" --------------------------------------------------------------------------	5 years ago
Harshavardhana	4915433bd2	Support bucket versioning (#9377 ) - Implement a new xl.json 2.0.0 format to support, this moves the entire marshaling logic to POSIX layer, top layer always consumes a common FileInfo construct which simplifies the metadata reads. - Implement list object versions - Migrate to siphash from crchash for new deployments for object placements. Fixes #2111	5 years ago
Klaus Post	43d6e3ae06	merge object lifecycle checks into usage crawler (#9579 )	5 years ago
kannappanr	225b812b5e	Update minio-go library to latest (#9813 )	5 years ago
Harshavardhana	96ed0991b5	fix: optimize IAM users load, add fallback (#9809 ) Bonus fix, load service accounts properly when service accounts were generated with LDAP	5 years ago
Harshavardhana	a42df3d364	Allow idiomatic usage of middlewares in gorilla/mux (#9802 ) Historically due to lack of support for middlewares we ended up writing wrapped handlers for all middlewares on top of the gorilla/mux, this causes multiple issues when we want to let's say - Overload r.Body with some custom implementation to track the incoming Reads() - Add other sort of top level checks to avoid DDOSing the server with large incoming HTTP bodies. Since 1.7.x release gorilla/mux provides proper use of middlewares, which are honored by the muxer directly. This makes sure that Go can honor its own internal ServeHTTP(w, r) implementation where Go net/http can wrap into its own customer readers. This PR as a side-affect fixes rare issues of client hangs which were reported in the wild but never really understood or fixed in our codebase. Fixes #9759 Fixes #7266 Fixes #6540 Fixes #5455 Fixes #5150 Refer https://github.com/boto/botocore/pull/1328 for one variation of the same issue in #9759	5 years ago
Harshavardhana	ff94b1b0a9	isEndpointConnected should take local disk inputs (#9803 ) PR #9801 while it is correct, the loop isEndpointConnected() was changed to rely on endpoint.String() which has the host information as well, which is not correct value as input to detect if the disk is down or up, if endpoint is local use its local path value instead.	5 years ago
Andreas Auernhammer	b1845c6c83	kes: try to auto. create master key if not present (#9790 ) This commit changes the data key generation such that if a MinIO server/nodes tries to generate a new DEK but the particular master key does not exist - then MinIO asks KES to create a new master key and then requests the DEK again. From now on, a SSE-S3 master key must not be created explicitly via: `kes key create <key-name>`. Instead, it is sufficient to just set the env. var. ``` export MINIO_KMS_KES_KEY_NAME=<key-name> ``` However, the MinIO identity (mTLS client certificate) must have the permission to access the `/v1/key/create/` API. Therefore, KES policy for MinIO must look similar to: ``` [ /v1/key/create/<key-name-pattern> /v1/key/generate/<key-name-pattern> /v1/key/decrypt/<key-name-pattern> ] ``` However, in our guides we already suggest that. See e.g.: https://github.com/minio/kes/wiki/MinIO-Object-Storage#kes-server-setup *** The ability to create master keys on request may also be necessary / useful in case of SSE-KMS.	5 years ago
Harshavardhana	62b1da3e2c	fix offline disk calculation (#9801 ) Current code was relying on globalEndpoints as the source of secondary truth to obtain the missing endpoints list when the disk is offline, this is problematic - there is no way to know if the getDisks() returned endpoints total is same as the ones list of globalEndpoints and it belongs to a particular set. - there is no order guarantee as getDisks() is ordered as per format.json, globalEndpoints may not be, so potentially end up including incorrect endpoints. To fix this bring getEndpoints() just like getDisks() to ensure that consistently ordered endpoints are always available for us to ensure that returned values are consistent with what each erasure set would observe.	5 years ago
poornas	d26b24f670	avoid storing X-Amz-Tagging-Directive in metadata (#9800 )	5 years ago
kannappanr	2c372a9894	Send Partscount only when partnumber is specified (#9793 ) Fixes #9789	5 years ago
poornas	3d3b75fb8d	Avoid overwriting object tags when changing lock (#9794 )	5 years ago
Klaus Post	142b057be8	Check object names on windows (#9798 ) Uploading files with names that could not be written to disk would result in "reduce your request" errors returned. Instead check explicitly for disallowed characters and reject files with `Object name contains unsupported characters.`	5 years ago
Harshavardhana	4790868878	allow background IAM load to speed up startup (#9796 ) Also fix healthcheck handler to run success only if object layer has initialized fully for S3 API access call.	5 years ago
Harshavardhana	342ade03f6	deprecate listDir usage for healing (#9792 ) listDir was incorrectly used for healing which is slower, instead use Walk() to heal the entire set.	5 years ago
P R	9407dbf387	display proper used space based on disk usage (#9551 ) Fixes #9346	5 years ago
Harshavardhana	423aeb0d81	allow large buffer to list more entries per directory (#9785 )	5 years ago
Anis Elleuch	790323ac37	lifecycle: Fix object expiration date (#9791 ) re-use PredictExpiryTime() in ComputeAction()	5 years ago
Harshavardhana	febe9cc26a	fix: avoid timer leaks in dsync/lsync (#9781 ) At a customer setup with lots of concurrent calls it can be observed that in newRetryTimer there were lots of tiny alloations which are not relinquished upon retries, in this codepath we were only interested in re-using the timer and use it wisely for each locker. ``` (pprof) top Showing nodes accounting for 8.68TB, 97.02% of 8.95TB total Dropped 1198 nodes (cum <= 0.04TB) Showing top 10 nodes out of 79 flat flat% sum% cum cum% 5.95TB 66.50% 66.50% 5.95TB 66.50% time.NewTimer 1.16TB 13.02% 79.51% 1.16TB 13.02% github.com/ncw/directio.AlignedBlock 0.67TB 7.53% 87.04% 0.70TB 7.78% github.com/minio/minio/cmd.xlObjects.putObject 0.21TB 2.36% 89.40% 0.21TB 2.36% github.com/minio/minio/cmd.(posix).Walk 0.19TB 2.08% 91.49% 0.27TB 2.99% os.statNolog 0.14TB 1.59% 93.08% 0.14TB 1.60% os.(File).readdirnames 0.10TB 1.09% 94.17% 0.11TB 1.25% github.com/minio/minio/cmd.readDirN 0.10TB 1.07% 95.23% 0.10TB 1.07% syscall.ByteSliceFromString 0.09TB 1.03% 96.27% 0.09TB 1.03% strings.(Builder).grow 0.07TB 0.75% 97.02% 0.07TB 0.75% path.(lazybuf).append ```	5 years ago
Praveen raj Mani	2ce2e88adf	Support mTLS Authentication in Webhooks (#9777 )	5 years ago
Harshavardhana	c7599d323b	fix: throw error if symmetry cannot be obtained (#9780 ) For example `{1...17}/{1...52}` symmetrical distribution of drives cannot be obtained - Because 17 is a prime number - Is not divisible by any pre-defined setCounts i.e from 1 to 16	5 years ago
Harshavardhana	d93bdea433	fix remove LDAPPassword from audit logs (#9773 ) the previous fix for #9707 was not correct, fix this properly passing the right filter keys to be filtered from the audit log output. Fixes #9767	5 years ago
Harshavardhana	5e529a1c96	simplify context timeout for readiness (#9772 ) additionally also add CORS support to restrict for specific origin, adds a new config and updated the documentation as well	5 years ago
Harshavardhana	5686a7e273	fix NAS gateway support for policy/notification (#9765 ) Fixes #9764	5 years ago
Harshavardhana	566e0e2048	allow deleting of dropped multiparts (#9753 ) bonus change trigger MRF heal when single offline disk is found, break out early.	5 years ago
Anis Elleuch	3aad09be28	heal: Fix passing healing opts (#9756 ) Manual healing (as background healing) creates a heal task with a possiblity to override healing options, such as deep or normal mode. Use a pointer type in heal opts so nil would mean use the default healing options.	5 years ago
Harshavardhana	f0358acb32	concurrently load bucket metadata (#9749 )	5 years ago
Anis Elleuch	fd0de4ab32	azure: Show better message when credentials are wrong (#9748 )	5 years ago
Anis Elleuch	73a308502f	Relax content-md5 requirement in set encryption handler (#9750 ) aws cli fails to set a bucket encryption configuration to MinIO server. The reason is that aws cli does not send MD5-Content header. It seems that MD5-Content is not required anymore. This commit also returns Not Implemented header early to help mint tests to ignore testing this API in gateway modes.	5 years ago
Anis Elleuch	bd59f150b8	azure: Implement CopyPart API (#9747 )	5 years ago
Harshavardhana	f90422a890	fix prometheus calculation of offline disks per instance (#9744 ) This was a regression introduced in `9baeda7` for prometheus calculation of offline disks which should be local to an instance. fixes #9742	5 years ago
Harshavardhana	8befedef14	simplify FS multipart cleanup (#9740 ) fixes #9671	5 years ago
Nathan Brown	2af3004409	Use registry to check Atime support on Windows (#9741 )	5 years ago
Harshavardhana	38ee40d59c	move to upstream code colinmarc/hdfs (#9738 ) - supports SASL based authentication now - upgrades to new changes in gokrb library - implement force delete feature Fixes #8206	5 years ago
kannappanr	d583f1ac0e	check if container is empty before invoking DeleteContainer (#9733 )	5 years ago
Harshavardhana	2bcb02f628	Avoid '\n' from constant strings (#9737 ) Fixes #9736	5 years ago
Klaus Post	167ddf9c9c	Workaround for Windows Docker Engine 19.03.8 (#9735 ) Add workaround for issue preventing servers from starting on Windows Docker Engine 19.03.8 Fixes #9726	5 years ago
Anton Huck	f833e41e69	IAM: Fix nil panic due to uninit. iamGroupPolicyMap. Fixes #9730 (#9734 )	5 years ago
Harshavardhana	41688a936b	fix: CopyObject behavior on expanded zones (#9729 ) CopyObject was not correctly figuring out the correct destination object location and would end up creating duplicate objects on two different zones, reproduced by doing encryption based key rotation.	5 years ago
Harshavardhana	b2db8123ec	Preserve errors returned by diskInfo to detect disk errors (#9727 ) This PR basically reverts #9720 and re-implements it differently	5 years ago
Harshavardhana	b330c2c57e	Introduce simpler GetMultipartInfo call for performance (#9722 ) Advantages avoids 100's of stats which are needed for each upload operation in FS/NAS gateway mode when uploading a large multipart object, dramatically increases performance for multipart uploads by avoiding recursive calls. For other gateway's simplifies the approach since azure, gcs, hdfs gateway's don't capture any specific metadata during upload which needs handler validation for encryption/compression. Erasure coding was already optimized, additionally just avoids small allocations of large data structure. Fixes #7206	5 years ago
kannappanr	7214a0160a	allow bucket policy to set/removed in NAS gateway (#9706 )	5 years ago
Anis Elleuch	375b79f11b	storage: Implement GetDiskID request in REST server side (#9720 ) GetDiskID() in storage rest client does not really issue a REST request to the remote disk, but returns an in-memory value instead. However, GetDiskID() should return an error when format.json is not found or for other similar issues (unmounted disks, etc..) GetDiskID() is only called when formatting disks and getting storage informatio, hence this commit should not have a performance degradation.	5 years ago
Harshavardhana	3da1869d5e	Avoid double reads on metadata during GetObject() (#9719 ) Overall TTFB can see a dramatic improvement with this change - did not do any benchmark as such but the change itself is self-explanatory	5 years ago
Harshavardhana	7cedc5369d	fix: send valid claims in AuditLogs for browser requests (#9713 ) Additionally also fix STS logs to filter out LDAP password to be sent out in audit logs. Bonus fix handle the reload of users properly by making sure to preserve the newer users during the reload to be not invalidated. Fixes #9707 Fixes #9644 Fixes #9651	5 years ago
Harshavardhana	53aaa5d2a5	Export bucket usage counts as part of bucket metrics (#9710 ) Bonus fixes in quota enforcement to use the new datastructure and use timedValue to cache a value/reload automatically avoids one less global variable.	5 years ago
P R	9d39fb3604	add copyobject tagging replace directive for gateway (#9711 )	5 years ago
Klaus Post	4a007e3767	Prefer local disks when fetching data blocks (#9563 ) If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead. It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger 1MB objects, before: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 76257: * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms Throughput: * Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST) ``` After: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 78845: * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms Throughput: * Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST) ``` Bonus fix: Only ask for heal once on an object.	5 years ago
Klaus Post	95814359bd	cache disk info to avoid repeated calls (#9682 ) This value is requested on every upload when there are multiple zones. Since this will result in an RPC call to every remote disk this scales quite badly in a distributed setup. Load every 1second interval. 2 servers, localhost only. In large distributed setups much bigger gains can be expected. ``` Operations: 21743 -> 22454 * Average: +3.28% (+0.0 MiB/s) throughput, +3.28% (+11.9) obj/s * Fastest: +3.37% (+0.0 MiB/s) throughput, +3.37% (+13.0) obj/s * 50% Median: +3.03% (+0.0 MiB/s) throughput, +3.03% (+11.2) obj/s * Slowest: +8.03% (+0.0 MiB/s) throughput, +8.03% (+22.8) obj/s ``` For easy management of this a generic helper has been added.	5 years ago

1 2 3 4 5 ...

2633 Commits (d55f4336aeb35dcba49920caa2947e83c27b1292)