minio

Commit Graph

Author	SHA1	Message	Date
Harshavardhana	0104af6bcc	delayed locks until we have started reading the body (#10474 ) This is to ensure that Go contexts work properly, after some interesting experiments I found that Go net/http doesn't cancel the context when Body is non-zero and hasn't been read till EOF. The following gist explains this, this can lead to pile up of go-routines on the server which will never be canceled and will die at a really later point in time, which can simply overwhelm the server. https://gist.github.com/harshavardhana/c51dcfd055780eaeb71db54f9c589150 To avoid this refactor the locking such that we take locks after we have started reading from the body and only take locks when needed. Also, remove contextReader as it's not useful, doesn't work as expected context is not canceled until the body reaches EOF so there is no point in wrapping it with context and putting a `select {` on it which can unnecessarily increase the CPU overhead. We will still use the context to cancel the lockers etc. Additional simplification in the locker code to avoid timers as re-using them is a complicated ordeal avoid them in the hot path, since locking is very common this may avoid lots of allocations.	4 years ago
Harshavardhana	34ea1d2167	fix: return correct error code for MetadataTooLarge (#10470 ) fixes #10469	4 years ago
Harshavardhana	9d95937018	update KMS docs indicating deprecation of AUTO_ENCRYPTION env	4 years ago
Klaus Post	fa01e640f5	Continous healing: add optional bitrot check (#10417 )	4 years ago
Harshavardhana	f355374962	add support for configurable remote transport deadline (#10447 ) configurable remote transport timeouts for some special cases where this value needs to be bumped to a higher value when transferring large data between federated instances.	4 years ago
Harshavardhana	bda0fe3150	fix: allow LDAP identity to support form body POST (#10468 ) similar to other STS APIs	4 years ago
Harshavardhana	b70995dd60	Revert "ilm: Remove null version if not latest with proper config (#10467 )" This reverts commit `4b6264da7d`.	4 years ago
Anis Elleuch	4b6264da7d	ilm: Remove null version if not latest with proper config (#10467 )	4 years ago
Harshavardhana	48919de301	fix: for defer'ed deleteObject use internal context (#10463 )	4 years ago
Harshavardhana	eb2934f0c1	simplify webhook DNS further generalize for gateway (#10448 ) continuation of the changes from `eaaf05a7cc` this further simplifies, enables this for gateway deployments as well	4 years ago
Klaus Post	b7438fe4e6	Copy metadata before spawning goroutine + prealloc maps (#10458 ) In `(*cacheObjects).GetObjectNInfo` copy the metadata before spawning a goroutine. Clean up a few map[string]string copies as well, reducing allocs and simplifying the code. Fixes #10426	4 years ago
Anis Elleuch	ce6cef6855	erasure: Call Walk() from all disks (#10445 ) It does not make sense to call Walk() in only N/2 disks and then requires N/2 quorum, just keep it N/2+1 The commit fixes this behavior.	4 years ago
Klaus Post	493c714663	Remove erasureSets and erasureObjects from ObjectLayer (#10442 )	4 years ago
Harshavardhana	e959c5d71c	fix: server panic in FS mode (#10455 ) fixes #10454	4 years ago
Harshavardhana	4a2928eb49	generate missing object delete bucket notifications (#10449 ) fixes #10381	4 years ago
Anis Elleuch	af88772a78	lifecycle: NoncurrentVersionExpiration considers noncurrent version age (#10444 ) From https://docs.aws.amazon.com/AmazonS3/latest/dev/intro-lifecycle-rules.html#intro-lifecycle-rules-actions ``` When specifying the number of days in the NoncurrentVersionTransition and NoncurrentVersionExpiration actions in a Lifecycle configuration, note the following: It is the number of days from when the version of the object becomes noncurrent (that is, when the object is overwritten or deleted), that Amazon S3 will perform the action on the specified object or objects. Amazon S3 calculates the time by adding the number of days specified in the rule to the time when the new successor version of the object is created and rounding the resulting time to the next day midnight UTC. For example, in your bucket, suppose that you have a current version of an object that was created at 1/1/2014 10:30 AM UTC. If the new version of the object that replaces the current version is created at 1/15/2014 10:30 AM UTC, and you specify 3 days in a transition rule, the transition date of the object is calculated as 1/19/2014 00:00 UTC. ```	4 years ago
Harshavardhana	9109148474	add support for new UA values for update an check (#10451 )	4 years ago
Nitish Tiwari	eaaf05a7cc	Add Kubernetes operator webook server as DNS target (#10404 ) This PR adds a DNS target that ensures to update an entry into Kubernetes operator when a bucket is created or deleted. See minio/operator#264 for details. Co-authored-by: Harshavardhana <harsha@minio.io>	4 years ago
Harshavardhana	958661cbb5	skip subdomain from bucket DNS which start with `minio.domain` (#10390 ) extend host matcher to reject the host match	4 years ago
Harshavardhana	6a0372be6c	cleanup tmpDir any older entries automatically just like multipart (#10439 ) also consider multipart uploads, temporary files in `.minio.sys/tmp` as stale beyond 24hrs and clean them up automatically	4 years ago
Harshavardhana	c13afd56e8	Remove MaxConnsPerHost settings to avoid potential hangs (#10438 ) MaxConnsPerHost can potentially hang a call without any way to timeout, we do not need this setting for our proxy and gateway implementations instead IdleConn settings are good enough. Also ensure to use NewRequestWithContext and make sure to take the disks offline only for network errors. Fixes #10304	4 years ago
Harshavardhana	96997d2b21	allow ctrl+c to be consistent at early startup (#10435 ) fixes #10431	4 years ago
Klaus Post	86a3319d41	Ignore config values from unknown subsystems (#10432 )	4 years ago
Harshavardhana	9f60e84ce1	always copy UserDefined metadata map (#10427 ) fixes #10426	4 years ago
Harshavardhana	572b1721b2	set max API requests automatically based on RAM (#10421 )	4 years ago
Harshavardhana	b0e1d4ce78	re-attach offline drive after new drive replacement (#10416 ) inconsistent drive healing when one of the drive is offline while a new drive was replaced, this change is to ensure that we can add the offline drive back into the mix by healing it again.	4 years ago
Harshavardhana	eb19c8af40	Bump response header timeout for proxying list request (#10420 )	4 years ago
Klaus Post	2d58a8d861	Add storage layer contexts (#10321 ) Add context to all (non-trivial) calls to the storage layer. Contexts are propagated through the REST client. - `context.TODO()` is left in place for the places where it needs to be added to the caller. - `endWalkCh` could probably be removed from the walkers, but no changes so far. The "dangerous" part is that now a caller disconnecting will propagate down, so a "delete" operation will now be interrupted. In some cases we might want to disconnect this functionality so the operation completes if it has started, leaving the system in a cleaner state.	4 years ago
poornas	0037951b6e	improve error message when remote target missing (#10412 )	4 years ago
Andreas Auernhammer	fbd1c5f51a	certs: refactor cert manager to support multiple certificates (#10207 ) This commit refactors the certificate management implementation in the `certs` package such that multiple certificates can be specified at the same time. Therefore, the following layout of the `certs/` directory is expected: ``` certs/ │ ├─ public.crt ├─ private.key ├─ CAs/ // CAs directory is ignored │ │ │ ... │ ├─ example.com/ │ │ │ ├─ public.crt │ └─ private.key └─ foobar.org/ │ ├─ public.crt └─ private.key ... ``` However, directory names like `example.com` are just for human readability/organization and don't have any meaning w.r.t whether a particular certificate is served or not. This decision is made based on the SNI sent by the client and the SAN of the certificate. *** The `Manager` will pick a certificate based on the client trying to establish a TLS connection. In particular, it looks at the client hello (i.e. SNI) to determine which host the client tries to access. If the manager can find a certificate that matches the SNI it returns this certificate to the client. However, the client may choose to not send an SNI or tries to access a server directly via IP (`https://<ip>:<port>`). In this case, we cannot use the SNI to determine which certificate to serve. However, we also should not pick "the first" certificate that would be accepted by the client (based on crypto. parameters - like a signature algorithm) because it may be an internal certificate that contains internal hostnames. We would disclose internal infrastructure details doing so. Therefore, the `Manager` returns the "default" certificate when the client does not specify an SNI. The default certificate the top-level `public.crt` - i.e. `certs/public.crt`. This approach has some consequences: - It's the operator's responsibility to ensure that the top-level `public.crt` does not disclose any information (i.e. hostnames) that are not publicly visible. However, this was the case in the past already. - Any other `public.crt` - except for the top-level one - must not contain any IP SAN. The reason for this restriction is that the Manager cannot match a SNI to an IP b/c the SNI is the server host name. The entire purpose of SNI is to indicate which host the client tries to connect to when multiple hosts run on the same IP. So, a client will not set the SNI to an IP. If we would allow IP SANs in a lower-level `public.crt` a user would expect that it is possible to connect to MinIO directly via IP address and that the MinIO server would pick "the right" certificate. However, the MinIO server cannot determine which certificate to serve, and therefore always picks the "default" one. This may lead to all sorts of confusing errors like: "It works if I use `https:instance.minio.local` but not when I use `https://10.0.2.1`. These consequences/limitations should be pointed out / explained in our docs in an appropriate way. However, the support for multiple certificates should not have any impact on how deployment with a single certificate function today. Co-authored-by: Harshavardhana <harsha@minio.io>	4 years ago
Harshavardhana	1c6781757c	add missing ListBucketVersions from policy actions (#10414 )	4 years ago
Harshavardhana	b4e3956e69	update KES docs to talk about 'mc encrypt' command (#10400 ) add a deprecation notice for KMS_AUTO_ENCRYPTION	4 years ago
Harshavardhana	8a291e1dc0	Cluster healthcheck improvements (#10408 ) - do not fail the healthcheck if heal status was not obtained from one of the nodes, if many nodes fail then report this as a catastrophic error. - add "x-minio-write-quorum" value to match the write tolerance supported by server. - admin info now states if a drive is healing where madmin.Disk.Healing is set to true and madmin.Disk.State is "ok"	4 years ago
Klaus Post	650dccfa9e	cache: Only start at high watermark (#10403 ) Currently, cache purges are triggered as soon as the low watermark is exceeded. To reduce IO this should only be done when reaching the high watermark. This simplifies checks and reduces all calls for a GC to go through `dcache.diskSpaceAvailable(size)`. While a comment claims that `dcache.triggerGC <- struct{}{}` was non-blocking I don't see how that was possible. Instead, we add a 1 size to the queue channel and use channel semantics to avoid blocking when a GC has already been requested. `bytesToClear` now takes the high watermark into account to it will not request any bytes to be cleared until that is reached.	4 years ago
Andreas Auernhammer	9a703befe6	crypto: reduce retry delay when retrying KES requests (#10394 ) This commit reduces the retry delay when retrying a request to a KES server by: - reducing the max. jitter delay from 3s to 1.5s - skipping the random delay when there are more KES endpoints available. If there are more KES endpoints we can directly retry to the request by sending it to the next endpoint - as pointed out by @krishnasrinivas	4 years ago
Klaus Post	9a1615768d	Fix flaky TestXLStorageVerifyFile (#10398 ) `TestXLStorageVerifyFile` would fail 1 in 256 if the first random character was 'a'. Instead write 256 bytes which has 1 in 256^256 probability.	4 years ago
Harshavardhana	37da0c647e	fix: delete marker compatibility behavior for suspended bucket (#10395 ) - delete-marker should be created on a suspended bucket as `null` - delete-marker should delete any pre-existing `null` versioned object and create an entry `null`	4 years ago
Harshavardhana	2acb530ccd	update rulesguard with new rules (#10392 ) Co-authored-by: Nitish Tiwari <nitish@minio.io> Co-authored-by: Praveen raj Mani <praveen@minio.io>	4 years ago
Klaus Post	3e1fb17b70	heal: Check for truncated files (#10399 ) When checking parts we already do a stat for each part. Since we have the on disk size check if it is at least what we expect. When checking metadata check if metadata is 0 bytes.	4 years ago
Klaus Post	a89d6b8e3d	Fix common Windows failure (#10397 ) The `getNonLoopBackIP` may grab an IP from an interface that doesn't allow binding (on Windows), so this test consistently fails. We exclude that specific error.	4 years ago
Klaus Post	1c085f7d1a	Fix crash on Windows when crawling (#10385 ) * readDirN: Check if file is directory `syscall.FindNextFile` crashes if the handle is a file. `errFileNotFound` matches 'unix' functionality: `d19b434ffc/cmd/os-readdir_unix.go (L106)` Fixes #10384	4 years ago
Harshavardhana	4b6585d249	support 'ldap:user' variable replacement properly (#10391 ) also update `ldap.go` examples with latest minio-go changes Fixes #10367	4 years ago
Harshavardhana	9ffad7fceb	discard empty endpoint in crypto kes introduced in `18725679c4`	4 years ago
Andreas Auernhammer	18725679c4	crypto: allow multiple KES endpoints (#10383 ) This commit addresses a maintenance / automation problem when MinIO-KES is deployed on bare-metal. In orchestrated env. the orchestrator (K8S) will make sure that `n` KES servers (IPs) are available via the same DNS name. There it is sufficient to provide just one endpoint.	4 years ago
Anis Elleuch	ba8a8ad818	ListObjectsV1 requests unnecessarily fail with offline nodes (#10386 ) ListObjectsV1 requests are actually redirected to a specific node, depending on the bucket name. The purpose of this behavior was to optimize listing. However, the current code sends a Bad Gateway error if the target node is offline, which is a bad behavior because it means that the list request will fail, although this is unnecessary since we can still use the current node to list as well (the default behavior without using proxying optimization) Currently, you can see mint fails when there is one offline node, after this PR, mint will always succeed.	4 years ago
Harshavardhana	102ad60dee	simplify removing temporary files (#10389 )	4 years ago
Gaige B Paulsen	859ef52886	update for smartos build (solaris too) (#10378 )	4 years ago
Harshavardhana	e730da1438	fix: referesh JWKS public keys upon failure (#10368 ) fixes #10359	4 years ago
Anis Elleuch	46ee8659b4	fix write quorum calculation for bucket operations (#10364 ) When the number of disks is odd, the calculation of quorum for bucket operations were not correct, fix it.	4 years ago
Harshavardhana	a359e36e35	tolerate listing with only readQuorum disks (#10357 ) We can reduce this further in the future, but this is a good value to keep around. With the advent of continuous healing, we can be assured that namespace will eventually be consistent so we are okay to avoid the necessity to a list across all drives on all sets. Bonus Pop()'s in parallel seem to have the potential to wait too on large drive setups and cause more slowness instead of gaining any performance remove it for now. Also, implement load balanced reply for local disks, ensuring that local disks have an affinity for - cleanupStaleMultipartUploads()	4 years ago

1 2 3 4 5 ...

2877 Commits (0104af6bccce0f8fce537a713c424f4aed2041f0)