Number of found documents: 540
Published from to

Experiment: Cooperative Decision Making via Reinforcement Learning
Berka, Milan
2018 - English
This report inspects cooperative decision making task using reinforcement learning. It serves for comparison with methodology based on fully probabilistic design of decision strategies. Keywords: decision making; reinforcement learning; cooperation Fulltext is available at external website.
Experiment: Cooperative Decision Making via Reinforcement Learning

This report inspects cooperative decision making task using reinforcement learning. It serves for comparison with methodology based on fully probabilistic design of decision strategies.

Berka, Milan
Ústav teorie informace a automatizace, 2018

Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies
Kárný, Miroslav; Hůla, František
2018 - English
Adaptive decision making learns an environment model serving a design of a decision policy. The policy-generated actions influence both the acquired reward and the future knowledge. The optimal policy properly balances exploitation with exploration. The inherent dimensionality\ncurse of decision making under incomplete knowledge prevents the realisation of the optimal design. Keywords: Exploitation; Exploration; Bayesian estimation; Adaptive systems; Fully probabilistic design; Kullback-Leibler divergence; Decision policy; Markov decision process Fulltext is available at external website.
Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies

Adaptive decision making learns an environment model serving a design of a decision policy. The policy-generated actions influence both the acquired reward and the future knowledge. The optimal policy ...

Kárný, Miroslav; Hůla, František
Ústav teorie informace a automatizace, 2018

DCTOOL-A4
Bakule, Lubomír; Papík, Martin; Rehák, Branislav
2018 - English
DCTOOL-A4 report presents draft of a manuscript, which is intended to be submitted for publication. The report provides a novel systematic approach to the analysis of asymptotic stability for output event-triggered uncertain centralized control systems. A class of nonlinear but nominally linear systems possessing unknown time-varying bounded uncertainties with known bounds is considered. Uncertainties are allowed in all system matrices. Original LMI-based suffi cient conditions are derived to guarantee asymptotic stability of closed-loop systems with both static output and observer-based feedback loop under even-triggered control. Both these output feedback strategies are extended to model-based uncertain control systems with\nquantized measurements. A logarithmic quantizer is considered. The Lyapunov-based approach and convex optimization serve as the main methods to derive the asymptotic LMI-based stability conditions. Bounds on the inter-event times to avoid the Zeno-effect are proved for all the cases considered. Finally, feasibility and effi ciency of the proposed strategies is demonstrated by providing numerical examples. Keywords: event-triggered control; networked control systems; large scale complex systems Available at various institutes of the ASCR
DCTOOL-A4

DCTOOL-A4 report presents draft of a manuscript, which is intended to be submitted for publication. The report provides a novel systematic approach to the analysis of asymptotic stability for output ...

Bakule, Lubomír; Papík, Martin; Rehák, Branislav
Ústav teorie informace a automatizace, 2018

DCTOOL-A5
Bakule, Lubomír; Papík, Martin; Rehák, Branislav
2018 - English
DCTOOL-A5 presents draft of a manuscript, which is intended to be submitted for publication. This report presents a new method for the decentralized event-triggered control design for large-scale uncertain systems. The results are formulated and proved in terms of linear matrix inequalities. Two design problems are solved: For interconnected systems without any quantization and for interconnected systems with local logarithmic quantizers. Results are illustrated by an example. Keywords: decentralized event-triggered control; networked control systems; large scale complex systems Available at various institutes of the ASCR
DCTOOL-A5

DCTOOL-A5 presents draft of a manuscript, which is intended to be submitted for publication. This report presents a new method for the decentralized event-triggered control design for large-scale ...

Bakule, Lubomír; Papík, Martin; Rehák, Branislav
Ústav teorie informace a automatizace, 2018

Appearance Acquisition and Analysis of Effect Coatings
Filip, Jiří; Maile, F. J.
2017 - English
Keywords: effect coatings; appearance capturing; polychromatic; particle orientation Fulltext is available at external website.
Appearance Acquisition and Analysis of Effect Coatings

Filip, Jiří; Maile, F. J.
Ústav teorie informace a automatizace, 2017

Multi-period Factor Model of a Loan Portfolio
Šmíd, Martin; Dufek, J.
2017 - English
We construct a general dynamic model of losses of a large loan portfolio, secured by collaterals. In the model, the wealth of a debtor and the price of the corresponding collateral depend each on two factors: a common one, having a general distribution, and an individual one, following an AR(1) process. The default of a loan happens if the wealth stops to be su cient for repaying the loan. We show that the mapping transforming the common factors into the probability of default (PD) and the loss given default (LGD) is one-to-one twice continuously differentiable. As the transformation is not analytically tractable, we propose a numerical technique for its computation and demonstrate its accuracy by a numerical study.\nWe show that the results given by our multi-period model may differ signi cantly from\nthose resulting from single-period models, and demonstrate that our model naturally replicates\nthe empirically observed decrease of PDs within a portfolio in time. In addition, we give a formula for the overall loss of the portfolio and, as an example of its application, we formulate a simple optimal scoring decision problem and discuss its solution. Keywords: Credit Risk; Structural Factor Models; Loan Portfolio Management Fulltext is available at external website.
Multi-period Factor Model of a Loan Portfolio

We construct a general dynamic model of losses of a large loan portfolio, secured by collaterals. In the model, the wealth of a debtor and the price of the corresponding collateral depend each on two ...

Šmíd, Martin; Dufek, J.
Ústav teorie informace a automatizace, 2017

Alternative Formulation of Pay-as-clear Auction in Electricity Markets
Aussel, D.; Červinka, Michal; Henrion, R.; Pištěk, Miroslav
2017 - English
In widely used formulation of pay-as-clear electricity market the clearing price is given by the Lagrange multiplier of the demand sat- isfaction constraint in the problem of the Independent System Operator (ISO). Following this idea, one may usually calculate the market clearing\nprice analytically even for problems of higher dimensions. However, the economic interpretation of such a market setting is in question, since the minimized criterion does not correspond neither to the cost of production nor to the overall payment of consumers. This observation motivated us\nto propose an alternative clearing mechanism where the total payment of consumers is explicitly minimized. We show existence and uniqueness of the clearing price in such a setting. Keywords: electricity market; Lagrange multiplier; Independent System Operator Available at various institutes of the ASCR
Alternative Formulation of Pay-as-clear Auction in Electricity Markets

In widely used formulation of pay-as-clear electricity market the clearing price is given by the Lagrange multiplier of the demand sat- isfaction constraint in the problem of the Independent System ...

Aussel, D.; Červinka, Michal; Henrion, R.; Pištěk, Miroslav
Ústav teorie informace a automatizace, 2017

Diffusion MCMC for Mixture Estimation
Reichl, Jan; Dedecius, Kamil
2016 - English
Distributed inference of parameters of mixture models by a network of cooperating nodes (sensors) with computational and communication capabilities still represents a challenging task. In the last decade, several methods were proposed to solve this issue, predominantly formulated within the expectation-maximization framework and with the assumption of mixture components normality. The present paper adopts the Bayesian approach to inference of general (non-normal) mixtures via the Markov chain Monte Carlo simulation from the parameter posterior distribution. By collaborative tuning of node chains, the method allows reliable estimation even at nodes with significantly worse observational conditions, where the components may tend to merge due to high variances. The method runs in the diffusion networks, where the nodes communicate only with their adjacent neighbors within 1 hop distance. Keywords: Mixture; mixture estimation; MCMC Fulltext is available at external website.
Diffusion MCMC for Mixture Estimation

Distributed inference of parameters of mixture models by a network of cooperating nodes (sensors) with computational and communication capabilities still represents a challenging task. In the last ...

Reichl, Jan; Dedecius, Kamil
Ústav teorie informace a automatizace, 2016

Adaptive Blind Separation of Instantaneous Linear Mixtures of Independent Sources
Šembera, Ondřej; Tichavský, Petr; Koldovský, Zbyněk
2016 - English
In many applications, there is a need to blindly separate independent sources from their linear instantaneous mixtures while the mixing matrix or source properties are slowly or abruptly changing in time. The easiest way to separate the data is to consider off-line estimation of the model parameters repeatedly in time shifting window. Another popular method is the stochastic natural gradient algorithm, which relies on non-Gaussianity of the separated signals and is adaptive by its nature. In this paper, we propose an adaptive version of two blind source separation algorithms which exploit non-stationarity of the original signals. The results indicate that the proposed algorithms slightly outperform the natural gradient in the trade-off between the algorithm’s ability to quickly adapt to changes in the mixing matrix and the variance of the estimate when the mixing is stationary. Keywords: blind separation; algorithms; block gaussian separation Fulltext is available at external website.
Adaptive Blind Separation of Instantaneous Linear Mixtures of Independent Sources

In many applications, there is a need to blindly separate independent sources from their linear instantaneous mixtures while the mixing matrix or source properties are slowly or abruptly changing in ...

Šembera, Ondřej; Tichavský, Petr; Koldovský, Zbyněk
Ústav teorie informace a automatizace, 2016

Basic facts concerning extreme supermodular functions
Studený, Milan
2016 - English
Elementary facts and observations on the cone of supermodular set functions are recalled. The manuscript deals with such operations with set functions which preserve supermodularity\nand the emphasis is put on those such operations which even preserve extremality (of a supermodular function). These involve a few self-transformations of the cone of supermodular set functions. Moreover, projections to the (less-dimensional) linear space of set functions for a subset of the variable set are discussed. Finally, several extensions to the (more-dimensional) linear space of set functions for a superset of the variable set are shown to be both preserving supermodularity and extremality. Keywords: supermodular function; standardizations; extreme supermodular function Fulltext is available at external website.
Basic facts concerning extreme supermodular functions

Elementary facts and observations on the cone of supermodular set functions are recalled. The manuscript deals with such operations with set functions which preserve supermodularity\nand the emphasis ...

Studený, Milan
Ústav teorie informace a automatizace, 2016

About project

NRGL provides central access to information on grey literature produced in the Czech Republic in the fields of science, research and education. You can find more information about grey literature and NRGL at service web

Send your suggestions and comments to nusl@techlib.cz

Provider

http://www.techlib.cz

Facebook

Other bases