深度學習21天實戰caffe學習筆記《6 : Caffe代碼梳理》

Caffe代碼梳理

1、caffe目錄結構

2、caffe閱讀路線：

src/caffe/proto/caffe.proto 了解基本數據結構內存對象和磁盤文件的一一映射，主要由ProtoBuffer工具完成；
include頭文件理解整個框架，從基類向派生類順藤摸瓜；
.cpp和.cu文件 caffe框架不需要大改，只需要按需求派生出新的類實現即可；
編寫各類工具集成到caffe內部如tools下的工具；

注：快速追蹤關鍵字

（1）多個終端，用vi的查找命令追蹤；

（2）使用linux grep，在caffe根目錄下運行：

$grep -n -H -R "XXXXX"

[ -n : 顯示行號； -H ：顯示文件名； -R ：遞歸查找每個子目錄；]

3、caffe支持的速度學習特性

卷積層和全連接層統稱為權值層，具有可學習參數（權值）：

3.1、卷積層：

卷積層計算步驟由二維增加至三維、四維卷積，多了“通道（channel）”，每個通道進行二維卷積，沒有“翻轉”，而是與輸入圖片做滑動窗口“相關”計算；

多個通道與多個卷積核分別進行二維卷積，得到多通道輸出，“合并”為一個通道；

注：其中L、I、J參數可以在.prototxt中找到，圖像大小M*N,輸出通道K從日志文件可以找到。

3.2、全連接層

全連接層每個節點與相鄰層的所有節點都連接，計算類型為矩陣-向量乘；

y=Wx （輸入節點向量x;維度D；輸出節點向量y，維度V；W為V.D維權值矩陣）

則：

單樣本前向傳播計算量：CaculationsMAC=V.D

參數統計：Params =V.D

CPR值： CPR=Caculations/Params=1(始終為1，與輸入輸出維度無關，權值重復利用率很低)

卷積層vs全連接層：參數量全連接層是卷積層的16倍；計算量只有25%；（得益于權值共享&局部連接）

3.3、**函數

常用的**函數：

在caffe中位于：include/caffe/neural_layers.hpp

src/caffe/proto/caffe.proto

message ReLUParameter {

  optional float negative_slope = 1 [default = 0];
  enum Engine {
    DEFAULT = 0;
    CAFFE = 1;
    CUDNN = 2;
  }
  optional Engine engine = 2 [default = DEFAULT];
}

Relu.layer

#ifndef CAFFE_RELU_LAYER_HPP_
#define CAFFE_RELU_LAYER_HPP_

#include <vector>

#include "caffe/blob.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"

#include "caffe/layers/neuron_layer.hpp"

namespace caffe {

/**
 * @brief Rectified Linear Unit non-linearity @f$ y = \max(0, x) @f$.
 *        The simple max is fast to compute, and the function does not saturate.
 */
template <typename Dtype>
class ReLULayer : public NeuronLayer<Dtype> {      //派生于NeuronLayer，實現Relu**函數的計算
 public:
  /**
   * @param param provides ReLUParameter relu_param,
   *     with ReLULayer options:
   *   - negative_slope (\b optional, default 0).
   *     the value @f$ \nu @f$ by which negative values are multiplied.
   */
  //顯示構造函數
  explicit ReLULayer(const LayerParameter& param)
      : NeuronLayer<Dtype>(param) {}
//返回類名字符串
  virtual inline const char* type() const { return "ReLU"; }

 protected:
  //前向傳播函數
  virtual void Forward_cpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);
  virtual void Forward_gpu(const vector<Blob<Dtype>*>& bottom,
      const vector<Blob<Dtype>*>& top);

  //反向傳播函數
  virtual void Backward_cpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);
  virtual void Backward_gpu(const vector<Blob<Dtype>*>& top,
      const vector<bool>& propagate_down, const vector<Blob<Dtype>*>& bottom);
};

}  // namespace caffe

#endif  // CAFFE_RELU_LAYER_HPP_

src/caffe/layers/relu_layer.cpp

#include <algorithm>
#include <vector>

#include "caffe/layers/relu_layer.hpp"

namespace caffe {

template <typename Dtype>
void ReLULayer<Dtype>::Forward_cpu(const vector<Blob<Dtype>*>& bottom,
    const vector<Blob<Dtype>*>& top) {
  const Dtype* bottom_data = bottom[0]->cpu_data();
  Dtype* top_data = top[0]->mutable_cpu_data();
  const int count = bottom[0]->count();
  Dtype negative_slope = this->layer_param_.relu_param().negative_slope();
  for (int i = 0; i < count; ++i) {
    top_data[i] = std::max(bottom_data[i], Dtype(0))
        + negative_slope * std::min(bottom_data[i], Dtype(0));
  }
}

template <typename Dtype>
void ReLULayer<Dtype>::Backward_cpu(const vector<Blob<Dtype>*>& top,
    const vector<bool>& propagate_down,
    const vector<Blob<Dtype>*>& bottom) {
  if (propagate_down[0]) {
    const Dtype* bottom_data = bottom[0]->cpu_data();
    const Dtype* top_diff = top[0]->cpu_diff();
    Dtype* bottom_diff = bottom[0]->mutable_cpu_diff();
    const int count = bottom[0]->count();
    Dtype negative_slope = this->layer_param_.relu_param().negative_slope();
    for (int i = 0; i < count; ++i) {
      bottom_diff[i] = top_diff[i] * ((bottom_data[i] > 0)
          + negative_slope * (bottom_data[i] <= 0));
    }
  }
}


#ifdef CPU_ONLY
STUB_GPU(ReLULayer);
#endif

INSTANTIATE_CLASS(ReLULayer);

}  // namespace caffe

本文鏈接：https://blog.csdn.net/Julialove102123/article/details/79088973

智能推薦

深度學習Caffe實戰筆記（1）環境搭建

（1）環境搭建從知道深度學習開始，就一直想學習使用caffe，礙于各種事情一直沒有如愿，這幾天終于找了個時間搞了一下，打算把學習的過程整理成筆記，包括環境搭建、跑車牌識別數據，跑mnist數據，用Alexnet跑自己的數據，用Siamese網絡跑mnist數據，用Siamese網絡跑自己的數據以及如何調整網絡結構等等。。。。后續我會慢慢更新，筆記的主要內容是如何使用caffe，主要側重于實戰，基...

《21天實戰caffe》讀書筆記（六）

第六天運行手寫體數字識別例程 1、MNIST數據集 MNIST數據集包括60000個訓練集和10000個測試集，每張圖都已經進行尺寸歸一化、數字居中處理，固定尺寸為28像素*28像素。 MNIST數據集可以在Caffe源碼框架的data/mnist下用get_mnist.sh腳本下載。 MNIST原始數據為4個文件，數據格式描述見下圖。下載到的原始數據集為二進制文件，需要轉換為LEVELDB或...

《深度學習——實戰caffe》——caffe數據結構

caffe中一個CNN模型由Net表示，Net由多個Layer堆疊而成。 caffe的萬丈高樓(Net)是由圖紙（prototxt），用blob這些磚塊筑成一層層（Layer），最后通過SGD方法（Solver）進行簡裝修（train）、精裝修（finetune）實現的。 Blob數據結構介紹 Blob提供了統一的存儲器接口，TensorFlow中的Tensor也有對應的Blob數據結構。 Caf...

深度學習caffe實戰筆記（4）Windows caffe平臺下跑cifar10

上一篇博客介紹了如何用alexnet跑自己的數據，能跑自己的數據按理說再跑cifar10應該沒問題了啊，但是想想還是要把cifar10的記錄下來，因為cifar10數據格式是屬于特殊的數據格式，需要用caffe環境把數據轉換文件編譯出來，這也是后面Siamese網絡所必須的一個步驟，說到Siamese網絡，，，，我要再哭5分鐘。好，五分鐘時間到，我們開始train。另外，如果是Ubuntu系統，跑...

《深度學習——實戰caffe》——繪制網絡模型

python/draw_net.py, 這個文件，就是用來繪制網絡模型的。也就是將網絡模型由prototxt變成一張圖片。在繪制之前，需要先安裝兩個庫１、安裝ＧraphViz 注意，這里用的是apt-get來安裝，而不是pip. 2 、安裝pydot 用的是pip來安裝，而不是apt-get 安裝好了，就可以調用腳本來繪制圖片了 draw_net.py執行的時候帶三個參數第一個參數：網絡模型...