Limited-Memory Broyden, Fletcher, Goldfarb, Shannon algorithm. More...
#include <shark/Algorithms/GradientDescent/LBFGS.h>
Public Member Functions | |
LBFGS () | |
std::string | name () const |
From INameable: return the class name. | |
void | setHistCount (unsigned int numhist) |
Specify the amount of steps to be memorized and used to find the L-BFGS direction. | |
void | read (InArchive &archive) |
Read the component from the supplied archive. | |
void | write (OutArchive &archive) const |
Write the component to the supplied archive. | |
Public Member Functions inherited from shark::AbstractLineSearchOptimizer< SearchPointType > | |
AbstractLineSearchOptimizer () | |
void | init (ObjectiveFunctionType const &objectiveFunction, SearchPointType const &startingPoint) |
void | step (ObjectiveFunctionType const &objectiveFunction) |
LineSearch< SearchPointType > const & | lineSearch () const |
LineSearch< SearchPointType > & | lineSearch () |
SearchPointType const & | derivative () const |
Returns the derivative at the current point. Can be used for stopping criteria. | |
Public Member Functions inherited from shark::AbstractSingleObjectiveOptimizer< SearchPointType > | |
std::size_t | numInitPoints () const |
By default most single objective optimizers only require a single point. | |
virtual void | init (ObjectiveFunctionType const &function, std::vector< SearchPointType > const &initPoints) |
Initialize the optimizer for the supplied objective function using a set of initialisation points. | |
virtual void | init (ObjectiveFunctionType const &function, SearchPointType const &startingPoint)=0 |
initializes the optimizer using a predefined starting point | |
virtual const SolutionType & | solution () const |
returns the current solution of the optimizer | |
Public Member Functions inherited from shark::AbstractOptimizer< PointType, ResultT, SolutionTypeT > | |
const Features & | features () const |
virtual void | updateFeatures () |
bool | requiresValue () const |
bool | requiresFirstDerivative () const |
bool | requiresSecondDerivative () const |
bool | canSolveConstrained () const |
bool | requiresClosestFeasible () const |
virtual | ~AbstractOptimizer () |
virtual void | init (ObjectiveFunctionType const &function) |
Initialize the optimizer for the supplied objective function. | |
virtual void | init (ObjectiveFunctionType const &function, std::vector< SearchPointType > const &initPoints)=0 |
Initialize the optimizer for the supplied objective function using a set of initialisation points. | |
virtual void | step (ObjectiveFunctionType const &function)=0 |
Carry out one step of the optimizer for the supplied objective function. | |
Public Member Functions inherited from shark::INameable | |
virtual | ~INameable () |
Public Member Functions inherited from shark::ISerializable | |
virtual | ~ISerializable () |
Virtual d'tor. | |
void | load (InArchive &archive, unsigned int version) |
Versioned loading of components, calls read(...). | |
void | save (OutArchive &archive, unsigned int version) const |
Versioned storing of components, calls write(...). | |
BOOST_SERIALIZATION_SPLIT_MEMBER () | |
Protected Member Functions | |
void | initModel () |
Initializes the internal model. | |
void | computeSearchDirection (ObjectiveFunctionType const &) |
Updates the Model and computes the next search direction. | |
Protected Member Functions inherited from shark::AbstractOptimizer< PointType, ResultT, SolutionTypeT > | |
void | checkFeatures (ObjectiveFunctionType const &objectiveFunction) |
Convenience function that checks whether the features of the supplied objective function match with the required features of the optimizer. | |
Additional Inherited Members | |
Protected Attributes inherited from shark::AbstractLineSearchOptimizer< SearchPointType > | |
LineSearch< SearchPointType > | m_linesearch |
used line search method. | |
std::size_t | m_dimension |
number of parameters | |
double | m_initialStepLength |
Initial step length to begin with the line search. | |
SearchPointType | m_derivative |
gradient of m_best.point | |
SearchPointType | m_searchDirection |
search direction of next step | |
SearchPointType | m_lastPoint |
previous point | |
SearchPointType | m_lastDerivative |
gradient of the previous point | |
double | m_lastValue |
value of the previous point | |
Protected Attributes inherited from shark::AbstractSingleObjectiveOptimizer< SearchPointType > | |
SolutionType | m_best |
Current solution of the optimizer. | |
Protected Attributes inherited from shark::AbstractOptimizer< PointType, ResultT, SolutionTypeT > | |
Features | m_features |
Limited-Memory Broyden, Fletcher, Goldfarb, Shannon algorithm.
BFGS is one of the best performing quasi-newton methods. However for large scale optimization, storing the full hessian approximation is infeasible due to O(n^2) memory requirement. The L-BFGS algorithm does not store the full hessian approximation but only stores the data used for updating in the last steps. The matrix itself is then regenerated on-the-fly in an implicit matrix scheme. This brings runtime and memory rquirements of a single step down to O(n*hist_size).
The number of steps stored can be set with setHistCount. This is 100 as default.
The algorithm is implemented for unconstrained and constrained optimization in the case of box constraints. When box constraints are present and the algorithm encounters a constraint, a dog-leg style algorithm is used:
first, all variables with active constraints (e.g. x_i = l_i and g_i > 0) are fixed, i.e. p_i = 0. For the remaining variables, the unconstrained optimization problem is solved. If the solution does not statisfy the box constraints, in the next step the cauchy point is computed. If the cauchy point is feasible, we search the point along the line between unconstrained optimum and cauchy point that lies exactly on the constraint. This is the point with smallest value along the path. This does not find the true optimal step in the unconstrained problem, however a cheap and reasonably good optimum which often improves over naive coordinate descent.
typedef AbstractLineSearchOptimizer<SearchPointType>::ObjectiveFunctionType shark::LBFGS< SearchPointType >::ObjectiveFunctionType |
|
inline |
Definition at line 77 of file LBFGS.h.
References shark::AbstractOptimizer< PointType, ResultT, SolutionTypeT >::CAN_SOLVE_CONSTRAINED, and shark::AbstractOptimizer< PointType, ResultT, SolutionTypeT >::m_features.
|
protectedvirtual |
Updates the Model and computes the next search direction.
After a step was performed, this method is called to compute the next search direction. This usually involves updating the internal model using the new and old step information. Afterwards m_searchDirection should contain the next search direction.
Implements shark::AbstractLineSearchOptimizer< SearchPointType >.
|
protectedvirtual |
Initializes the internal model.
Line Search Methods use a Model to search for the next search direction. The model is initialized during init()
Implements shark::AbstractLineSearchOptimizer< SearchPointType >.
|
inlinevirtual |
From INameable: return the class name.
Reimplemented from shark::INameable.
|
virtual |
Read the component from the supplied archive.
[in,out] | archive | The archive to read from. |
Reimplemented from shark::AbstractLineSearchOptimizer< SearchPointType >.
|
inline |
Specify the amount of steps to be memorized and used to find the L-BFGS direction.
numhist | The amount of steps to use. |
Definition at line 88 of file LBFGS.h.
References SHARK_RUNTIME_CHECK.
|
virtual |
Write the component to the supplied archive.
[in,out] | archive | The archive to write to. |
Reimplemented from shark::AbstractLineSearchOptimizer< SearchPointType >.