embedding_finetuned / README.md
SamagraDataGov's picture
pytorch_model.bin upload/update
a9706a5 verified
---
base_model: BAAI/bge-small-en-v1.5
datasets: []
language: []
library_name: sentence-transformers
metrics:
- cosine_accuracy@1
- cosine_accuracy@5
- cosine_accuracy@10
- cosine_precision@1
- cosine_precision@5
- cosine_precision@10
- cosine_recall@1
- cosine_recall@5
- cosine_recall@10
- cosine_ndcg@5
- cosine_ndcg@10
- cosine_ndcg@100
- cosine_mrr@5
- cosine_mrr@10
- cosine_mrr@100
- cosine_map@100
- dot_accuracy@1
- dot_accuracy@5
- dot_accuracy@10
- dot_precision@1
- dot_precision@5
- dot_precision@10
- dot_recall@1
- dot_recall@5
- dot_recall@10
- dot_ndcg@5
- dot_ndcg@10
- dot_ndcg@100
- dot_mrr@5
- dot_mrr@10
- dot_mrr@100
- dot_map@100
pipeline_tag: sentence-similarity
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:900
- loss:GISTEmbedLoss
widget:
- source_sentence: How many more Kiosks was SBI planning to establish in the next
one year?
sentences:
- '''First installment due on (date) : ii). Last Installment due on (date)
: 6. b). Cash Credit : Limit: Drawing Power: Outstanding: Comments
on Irregularity ( if any): Any adverse comments on the unit by inspecting
official in last inspection report: 7. A. Cost of Project (as accepted by
sanctioning authority)(In Rs. Lakh) B. Means of Finance (as accepted by sanctioning
authority)(In Rs. Lakh) Give component wise details a. Term loan of Bank: b.
Promoter Equity c. Unsecured loan : d. Others if any Total Total 8. A.
Forward Linkages: B. Backward Linkages with Small/Marginal farmers: 1 No.
of members: 2 Details of Primary and Collateral Securities taken by the
bank (if any) 3 a. Primary Securities b. Collateral Securities 4 5 6 (Please
enclose details separately) 9 NameoftheConsortium(ifany)associatedwithCreditFacilitywithcompleteaddress,contac
t details and email: 9 a) Address (*with pin-code) : 9 b) Contact Details
: 9 c) Email Address : Request of Branch head for Credit Guarantee:- In
view of the above information, we request Credit Guarantee Cover against Credit
Facility of Rs.....................(in Rupees ) to FPO(copy of sanction letter
along with appraisal/process note of competent authority is enclosed for your
perusal and record ). Further we confirm that : 1. The KYC norms in respect of
the Promoters have been complied by us. 2. Techno-feasibility and economic viability
aspect of the project has been taken care of by the sanctioning authority and
the branch. 3. On quarterly basis, bank will apprise the ........................(Name
of Implementing Agency)about progress of unit, recovery of bank''s dues and present
status of account to........................(Name of Implementing Agency) 4.
We undertake to abide by the Terms & Conditions of the Scheme.'''
- '''Date: To, (i) The Managing Director Small Farmers'' Agri-Business Consortium
(SFAC), NCUI Auditorium, August Kranti Marg, Hauz Khas, New Delhi 110016. (ii)The
Managing Director National Co-operative Development Corporation (NCDC), 4, Siri
Institutional Area, Hauz Khas, New Delhi 110016. (iii) The Chief General Manager
National Bank for Agriculture and Rural Development (NABARD), Regional Office
--------------------------------------------------------------- (iv) To any other
additional Implementing Agency allowed/designated, as the case may be. Sub: Application
for Equity Grant under scheme of Formation and Promotion of 10,000 Farmer Producer
Organizations (FPOs) Dear Sir/Madam, We herewith apply for Equity Grant as per
the provisions under the captioned scheme. 1. The details of the FPO are as under- S.
No. Particulars to be furnished Details 1. Name of the FPO 2. Correspondence
address of FPO 3. Contact details of FPO 4. Registration Number 5. Date
of registration/incorporation of FPO 6. Brief account of business of FPO 7. Number
of Shareholder Members 8. Number of Small, Marginal and Landless Shareholder
Members'''
- '''1.6 Due to greater acceptability of the federations in the villages, State
Bank of India (SBI) approved opening of Kiosks under BC model through federations
to achieve financial inclusion. As on September 2014, 16 kiosks were working through
three farmer club federations. SBI was in the process of establishing 14 more
Kiosks at other village centres in the next one year in the district. The Kiosks
were attached to the nearest branch and worked under the guidance of the concerned
Branch Manager. The Branch Manager supervises and monitors the work of the Kiosks
(BC). 1.7 At present, the Kiosks are mainly involved in providing banking services
like, opening of savings bank accounts, recurring deposit accounts, acceptance
of deposits and payment towards withdrawal. The kiosks are also dispensing old
age pensions, student scholarships, MNREGA payments and other social sector payments,
routed by the Government. The present monthly income (Rs. 8000 to Rs. 14,000)
of the Kiosk is mainly from banking services. The expenditure involved was salary
to the operator, rent of the premises, interest on the initial investment etc.,
which is about Rs. 8000 to Rs. 10,000 (Salary of the operator-Rs.4000 to Rs. 5000,
Premises rent-about Rs. 2000 to Rs. 3000).'''
- source_sentence: How are the kiosks attached to the nearest branch?
sentences:
- '''In addition, past yield data for requisite number of years will have to be
made available separately for both 7.2.6 While notifying the crop(s) where a
specific conversion factor is being used for reporting of yield such as in the
case of rice/paddy etc, due care should be taken by the State Nodal Department
to use the relevant specific nomenclature for disclosure of Average Yield, Threshold
Yield and Actual Yield while releasing the Tender Document and submission of Yield
data and CCE data for calculation of admissible claims. Insurance Companies will
also be responsible for prior scrutiny of Tender document. Information/data provided
in Tender document will be treated as final and in case of any error/ misreporting/disparity,
State Govt. and Insurance Company will be equally liable for payment of additional
claims arising on account of it, if any. 7.2.7 For the current season or subsequent
seasons (in a multi-year contract), the States, if required, can notify additional
IUs or de-notify certain IUs subject to maximum deviation of 10% of already
notified IUs for the crop within a district at the same premium rate, before
the cut-off date for debit of premium. If the deviation is >10% or in case of
addition of new crop, actuarial premium rate may be worked out either by calculation
of weighted average premium rate as prevalent in contiguous districts or by
applying appropriate loading on the existing premium rate. The rates for such
crops will be determined /verified by TSU and its decision will be binding on
both States and ICs. 7.2.8 States implementing PMFBY at Village/ Village Panchayat
level for major crops shall be entitled for 50% reimbursement of incremental
expenses of CCEs and cost of smart phones/ improved technology **from GOI.** Only
eligible items will be considered for reimbursement.'''
- '''i. The credit guarantee cover per FPO will be limited to the project loan of
Rs. 2 crore. In case of project loan up to Rs. 1 crore, credit guarantee cover
will be 85% of bankable project loan with ceiling of Rs. 85 lakh; while in case
of project loan above Rs.1 crore and up to Rs. 2 crore, credit guarantee cover
will be 75% of bankable project loan with a maximum ceiling of Rs. 150 lakh. However,
for project loan over Rs. 2 crore of bankable projet loan, credit guarantee cover
will be limited maximum upto Rs.2.0 crore only. ii. ELI shall be eligible to
seek Credit Guarantee Cover for a credit facility sanctioned in respect of a
single FPO borrower for a maximum of 2 times over a period of 5 years. iii.
In case of default, claims shall be settled up to 85% or 75 % of the amount in default
subject to maximum cover as specified above. iv. Other charges such as penal
interest, commitment charge, service charge, or any other levies/ expenses, or
any costs whatsoever debited to the account of FPO by the ELI other than the
contracted interest shall not qualify for Credit Guarantee Cover. v. The Cover
shall only be granted after the ELI enters into an agreement with NABARD or NCDC,
as the case may be, and shall be granted or delivered in accordance with the Terms
and Conditions decided upon by NABARD or NCDC, as the case may be, from time to
time.'''
- '''1.6 Due to greater acceptability of the federations in the villages, State
Bank of India (SBI) approved opening of Kiosks under BC model through federations
to achieve financial inclusion. As on September 2014, 16 kiosks were working through
three farmer club federations. SBI was in the process of establishing 14 more
Kiosks at other village centres in the next one year in the district. The Kiosks
were attached to the nearest branch and worked under the guidance of the concerned
Branch Manager. The Branch Manager supervises and monitors the work of the Kiosks
(BC). 1.7 At present, the Kiosks are mainly involved in providing banking services
like, opening of savings bank accounts, recurring deposit accounts, acceptance
of deposits and payment towards withdrawal. The kiosks are also dispensing old
age pensions, student scholarships, MNREGA payments and other social sector payments,
routed by the Government. The present monthly income (Rs. 8000 to Rs. 14,000)
of the Kiosk is mainly from banking services. The expenditure involved was salary
to the operator, rent of the premises, interest on the initial investment etc.,
which is about Rs. 8000 to Rs. 10,000 (Salary of the operator-Rs.4000 to Rs. 5000,
Premises rent-about Rs. 2000 to Rs. 3000).'''
- source_sentence: What is the principle on which the Scheme operates?
sentences:
- '''| Sl. No Section | Page No. |\n|-------------------------------------------------------------|---------------------------------------------------------------------------------|\n|
Abbreviations | I-II |\n|
1 | Objective of the
Scheme |\n| 2 |
Adoption of Technology for Scheme Administration |\n|
3 | Coverage of Farmers |\n|
4 | Coverage of Crops |\n|
5 | Coverage of Risks
& Exclusions |\n| 6 |
Preconditions for implementation of the Scheme |\n|
7 | Notification |\n|
8 | |\n|
Engagement of Common Service Centres and Intermediaries for | |\n|
coverage of non loanee Farmers | |\n|
11 | |\n|
9 | Electronic Remittance
of Funds |\n| 10 |
Census code Mapping of Entities |\n|
11 | Digitization of
Land Records |\n| 12 |
Sum Insured/Coverage Limit |\n|
13 | Premium Rates and
Premium Subsidy |\n| 14 |
Budget for Administrative Expenses |\n|
15 | Technical Support
Unit(TSU)/Central Programme Management Unit(CPMU) |\n| 16 |
Seasonality Discipline |\n|
17 | Collection of Proposals
and Premium from Farmers |\n| 18 |
Assessment of Loss/Short Fall in Yield |\n|
19 | Dispute Resolution
regarding Yield Data/Crop Loss |\n| 20 |
Use of Innovative Technologies |\n|
21 | Assessment of Claims |\n|
22 | Participation of
Loss Assessors/Evaluators for Loss Assessment under the Scheme |\n| 23 |
Procedure for Settlement of Claims |\n|
24 | Important Conditions/Clauses
Applicable for Coverage of Risks |\n| 25 |
Acreage Discrepancy |\n|
26 | Publicity and Awareness |\n|
27 | Service Charges |\n|
28 | Goods & Service
Tax(GST) |\n| 29 |
Monitoring and Review of the Scheme |\n|
30 | Grievance Redressal
Mechanism |\n| 31 |
Empanelment and Selection of Insurance Companies |\n|
32 | Clustering/Clubbing
of districts for bidding by the State |\n| 33 |
Assessment of Performance and De-empanelment of Insurance Companies |\n|
34 | Evaluation of Efficiency
of Nodal Department of the State |\n| 35 |
Role & Responsibilities of Various Agencies |\n|
36 | National Crop Insurance
Portal for administration of Crop Insurance Program |\n| Annexure - 1 |
78-85 |\n|
Annexure - 2 | 86-89 |\n|
Annexure - 3 | 90-93 |'''
- '''16.1 The cut-off date is uniform for both loanee and non-loanee cultivators.
The State-wise cut-off dates for different crops shall be based on Crop Calendar
of major crops published from time to time by the Directorate of Economics and
Statistics, DAC&FW,GOI. The latest copy of the Crop Calendar (District Wise,
Crop Wise) is available on www.pmfby.gov.in. The SLCCCI, shall besides considering
the prevailing agro-climatic conditions, rainfall distribution/ availability
of water for irrigation, sowing pattern etc. in consultation with the Insurance
Company fix seasonality discipline of the coverage and other activities in such
a way that it does not encourage adverse selection or moral hazards. If this
is violated by SLCCCI, GOI may decide not to provide premium subsidy. 16.2 The
**broad indicative seasonality discipline** is given in the Table 2 below:'''
- '''7.2.1 The Scheme shall operate on the principle of \''Area Approach\'' in
the selected defined areas called Insurance Unit (IU). State Govt. /UT will notify
crops and defined areas covered during the season in accordance with decision
taken in the meeting of SLCCCI. State/UT Govt. should notify Village/Village Panchayat
or any other equivalent unit as an insurance unit for major crops defined at District
/ Taluka or equivalent level. For **other crops** it may be a unit of size above
the level of Village/village Panchayat. For defining a crop as a major crop for
deciding the Insurance Unit level, the sown area of'''
- source_sentence: How can the government prioritize FPOs?
sentences:
- ''' 2.7 Secured credential/login, preferably linked with Aadhaar Number and
mobile OTP based, for all Stakeholders viz, Central Government, State Governments,
Banks, empanelled Insurance Companies and their designated field functionaries
will be provided on the Portal to enable them to enter/upload/download the
requisite information. 2.8 Insurance Companies shall not distribute/collect/allow
any other proforma/utility/web Portal etc for collecting details of insured
farmers separately. However they may provide all requisite support to facilitate
Bank Branches/PACS for uploading the farmer''s details on the Portal well within
the prescribed cut-off dates. 2.9 Only farmers whose data is uploaded on
the National Crop Insurance Portal shall be eligible for Insurance coverage
and the premium subsidy from State and Central Govt. will be released accordingly. 2.10 All
data pertaining to crop-wise, area-wise historical yield data, weather data, sown
area, coverage and claims data, calamity years and actual yield shall be made
available on the National Crop Insurance Portal for the purpose of premium
rating, claim calculation etc. 2.11 Banks/Financial Institutions/other intermediaries
need to compulsorily transfer the individual farmer''s data electronically
to the National Crop Insurance Portal. Accordingly Banks/FIs may endeavour to undertake
CBS integration in a time bound manner for real time transfer of information/data. 2.12 It
is also proposed to develop an integrated platform/portal for both PMFBY and Interest
Subvention Scheme. The data/information of both the Schemes shall be auto synchronized
to enable real time sharing of information and better program monitoring. 2.13 Insurance
Companies shall compulsorily use technology/mobile applications for monitoring
of crop health/Crop Cutting Experiments (CCEs) in coordination with concerned
States. States shall also facilitate Insurance Companies with Satellite Imagery/Usage
of Drones by way of prior approval of agency from which such data can be sourced.
This is required for better monitoring and ground- truthing.'''
- ''' (vi) States/Union Territories may actively consider to make available appropriate
size of land to FPOs for setting up of CFCs and CHCs at cheaper rate on rent/lease or
otherwise; or may make available free of cost. (vii) Government may prioritize
FPOs to undertake procurement operation on Minimum Support Price (MSP). (viii)
States must actively consider encouraging FPOs for selling their produce through e-National
Agriculture Market (e-NAM) including FPO module of e-NAM or through other electronic
platform from their premises itself without physically bringing the produce to
the APMC market yards. (ix) Department of Agriculture, Cooperation & Farmers
Welfare is authorized to finalize Operational Guidelines of the scheme (and model
Bye Laws if any) including mid-term changes thereto, and issue the same with the
approval of Hon''ble Minister for Agriculture & Farmers'' Welfare. .'''
- '''1.1 Chhattisgarh is among the few states in India that have recorded impressive
growth in agriculture in recent years. Development of farmers own institutions
catering to their various needs, has kept pace with the agricultural growth. As
on 30 September 2014, the state had 3,679 farmers clubs (FCs). There were eight
federations of farmer clubs in the state, five in Mahasamund, two in Bilaspur
and one in Mungeli district. In Bilaspur and Mungeli districts (the study area),
300 FCs were formed, of which 201 were active. Majority of the farmer clubs (129
clubs) were formed by the Regional Rural Bank (Gramin Bank). Other promoting institutions
include Chhattisgarh Agricon Samiti (30), CARMDAKSH (12), SBI (12), ARDB (8) and
IFFDC (5). While all the clubs were active in the initial three years, many slipped
into dormancy through inaction and non-availability of hand-holding support. These
clubs did not have any vision or roadmap for the future. 1.2 The Chhattisgarh
RO and DDM Bilaspur were keen to make the farmer clubs a sustainable entity and
felt the need to federate the clubs to a higher tier so as to make the entire
farmer clubs programme sustainable and the organization a viable model. With this
in view, the farmer clubs were federated into four farmer club federations and
were registered under ''Chhattisgarh Society Registrikaran Adhiniyam, 1973'' in
the year 2012.'''
- source_sentence: What is the credit guarantee cover for a project loan up to Rs.
1 crore?
sentences:
- ''' (ii) Ongoing schemes of Government will be used in convergence to enhance
the cost effectiveness of FPOs in production and raising productivity and also
to meet the cost of infrastructure requirement of the FPOs. Implementing Agency
may converge the fund available with various on-going Government of India schemes
such as Rashtriya Krishi Vikas Yojna (RKVY), Mission for Integrated Development
for Horticulture (MIDH),National Food Security Mission (NFSM), Pradhan Mantri
Kisan Sampada Yojna (PM-SAMPADA), Deendayal Antyodaya Yojna-National Rural Livelihood
Mission (DAY-NRLM), PM- FME Scheme of MoFPI, TRIFED etc. in programs, activities
and creation of infrastructure like Custom Hiring Centre/Common Facilitation Centre
with machinery/equipment relating to production and post-production, value addition
and farm level processing, storage and other activities to make FPOs sustainable
and economically viable. (iii) Further, Agricultural Marketing Infrastructure
(AMI) Sub-Scheme of Integrated Scheme for Agriculture Marketing (ISAM) will also
be converged and an FPO willing to develop post-harvest management and marketing
infrastructure can avail assistance thereunder. (iv) States/ Union Territories
can avail assistance for development of marketing and farm level value addition
infrastructure/facilities for FPOs including setting up of Custom Hiring Centre
(CHC)/Common Facilitation Center (CFC) for marketing and supply chain etc. under
Agri- Market Infrastructure Fund (AMIF) approved for creation in NABARD for development
of marketing and farm level value addition infrastructure/facilities in Gramin
Agriculture Markets (GrAMs). In this case, operational guidelines of AMIF and
NABARD''s procedure and terms and conditions of sanction and repayment of loan
for AMIF shall be applicable. (v) States/Union Territories can top up and additionally
supplement the activities of FPOs from their own fund for activities and infrastructure
not covered under Government of India Scheme.'''
- '''7.2.1 The Scheme shall operate on the principle of \''Area Approach\'' in
the selected defined areas called Insurance Unit (IU). State Govt. /UT will notify
crops and defined areas covered during the season in accordance with decision
taken in the meeting of SLCCCI. State/UT Govt. should notify Village/Village Panchayat
or any other equivalent unit as an insurance unit for major crops defined at District
/ Taluka or equivalent level. For **other crops** it may be a unit of size above
the level of Village/village Panchayat. For defining a crop as a major crop for
deciding the Insurance Unit level, the sown area of'''
- '''i. The credit guarantee cover per FPO will be limited to the project loan of
Rs. 2 crore. In case of project loan up to Rs. 1 crore, credit guarantee cover
will be 85% of bankable project loan with ceiling of Rs. 85 lakh; while in case
of project loan above Rs.1 crore and up to Rs. 2 crore, credit guarantee cover
will be 75% of bankable project loan with a maximum ceiling of Rs. 150 lakh. However,
for project loan over Rs. 2 crore of bankable projet loan, credit guarantee cover
will be limited maximum upto Rs.2.0 crore only. ii. ELI shall be eligible to
seek Credit Guarantee Cover for a credit facility sanctioned in respect of a
single FPO borrower for a maximum of 2 times over a period of 5 years. iii.
In case of default, claims shall be settled up to 85% or 75 % of the amount in default
subject to maximum cover as specified above. iv. Other charges such as penal
interest, commitment charge, service charge, or any other levies/ expenses, or
any costs whatsoever debited to the account of FPO by the ELI other than the
contracted interest shall not qualify for Credit Guarantee Cover. v. The Cover
shall only be granted after the ELI enters into an agreement with NABARD or NCDC,
as the case may be, and shall be granted or delivered in accordance with the Terms
and Conditions decided upon by NABARD or NCDC, as the case may be, from time to
time.'''
model-index:
- name: SentenceTransformer based on BAAI/bge-small-en-v1.5
results:
- task:
type: information-retrieval
name: Information Retrieval
dataset:
name: val evaluator
type: val_evaluator
metrics:
- type: cosine_accuracy@1
value: 0.54
name: Cosine Accuracy@1
- type: cosine_accuracy@5
value: 0.89
name: Cosine Accuracy@5
- type: cosine_accuracy@10
value: 0.92
name: Cosine Accuracy@10
- type: cosine_precision@1
value: 0.54
name: Cosine Precision@1
- type: cosine_precision@5
value: 0.17799999999999994
name: Cosine Precision@5
- type: cosine_precision@10
value: 0.09199999999999997
name: Cosine Precision@10
- type: cosine_recall@1
value: 0.54
name: Cosine Recall@1
- type: cosine_recall@5
value: 0.89
name: Cosine Recall@5
- type: cosine_recall@10
value: 0.92
name: Cosine Recall@10
- type: cosine_ndcg@5
value: 0.7328328247017718
name: Cosine Ndcg@5
- type: cosine_ndcg@10
value: 0.7420327705006653
name: Cosine Ndcg@10
- type: cosine_ndcg@100
value: 0.7588813763663693
name: Cosine Ndcg@100
- type: cosine_mrr@5
value: 0.6800000000000002
name: Cosine Mrr@5
- type: cosine_mrr@10
value: 0.6835000000000001
name: Cosine Mrr@10
- type: cosine_mrr@100
value: 0.6868224050433418
name: Cosine Mrr@100
- type: cosine_map@100
value: 0.6868224050433418
name: Cosine Map@100
- type: dot_accuracy@1
value: 0.55
name: Dot Accuracy@1
- type: dot_accuracy@5
value: 0.89
name: Dot Accuracy@5
- type: dot_accuracy@10
value: 0.92
name: Dot Accuracy@10
- type: dot_precision@1
value: 0.55
name: Dot Precision@1
- type: dot_precision@5
value: 0.17799999999999994
name: Dot Precision@5
- type: dot_precision@10
value: 0.09199999999999997
name: Dot Precision@10
- type: dot_recall@1
value: 0.55
name: Dot Recall@1
- type: dot_recall@5
value: 0.89
name: Dot Recall@5
- type: dot_recall@10
value: 0.92
name: Dot Recall@10
- type: dot_ndcg@5
value: 0.7365235271660572
name: Dot Ndcg@5
- type: dot_ndcg@10
value: 0.7457234729649508
name: Dot Ndcg@10
- type: dot_ndcg@100
value: 0.7625720788306548
name: Dot Ndcg@100
- type: dot_mrr@5
value: 0.6850000000000002
name: Dot Mrr@5
- type: dot_mrr@10
value: 0.6885000000000001
name: Dot Mrr@10
- type: dot_mrr@100
value: 0.6918224050433417
name: Dot Mrr@100
- type: dot_map@100
value: 0.6918224050433417
name: Dot Map@100
---
# SentenceTransformer based on BAAI/bge-small-en-v1.5
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) <!-- at revision 5c38ec7c405ec4b44b94cc5a9bb96e735b38267a -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 384 tokens
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("SamagraDataGov/embedding_finetuned")
# Run inference
sentences = [
'What is the credit guarantee cover for a project loan up to Rs. 1 crore?',
"'i. The credit guarantee cover per FPO will be limited to the project loan of Rs. 2 crore. In case of project loan up to Rs. 1 crore, credit guarantee cover will be 85% of bankable project loan with ceiling of Rs. 85 lakh; while in case of project loan above Rs.1 crore and up to Rs. 2 crore, credit guarantee cover will be 75% of bankable project loan with a maximum ceiling of Rs. 150 lakh. However, for project loan over Rs. 2 crore of bankable projet loan, credit guarantee cover will be limited maximum upto Rs.2.0 crore only. ii. ELI shall be eligible to seek Credit Guarantee Cover for a credit facility sanctioned in respect of a single FPO borrower for a maximum of 2 times over a period of 5 years. iii. In case of default, claims shall be settled up to 85% or 75 % of the amount in default subject to maximum cover as specified above. iv. Other charges such as penal interest, commitment charge, service charge, or any other levies/ expenses, or any costs whatsoever debited to the account of FPO by the ELI other than the contracted interest shall not qualify for Credit Guarantee Cover. v. The Cover shall only be granted after the ELI enters into an agreement with NABARD or NCDC, as the case may be, and shall be granted or delivered in accordance with the Terms and Conditions decided upon by NABARD or NCDC, as the case may be, from time to time.'",
"'7.2.1 The Scheme shall operate on the principle of \\'Area Approach\\' in the selected defined areas called Insurance Unit (IU). State Govt. /UT will notify crops and defined areas covered during the season in accordance with decision taken in the meeting of SLCCCI. State/UT Govt. should notify Village/Village Panchayat or any other equivalent unit as an insurance unit for major crops defined at District / Taluka or equivalent level. For **other crops** it may be a unit of size above the level of Village/village Panchayat. For defining a crop as a major crop for deciding the Insurance Unit level, the sown area of'",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
## Evaluation
### Metrics
#### Information Retrieval
* Dataset: `val_evaluator`
* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
| Metric | Value |
|:--------------------|:-----------|
| cosine_accuracy@1 | 0.54 |
| cosine_accuracy@5 | 0.89 |
| cosine_accuracy@10 | 0.92 |
| cosine_precision@1 | 0.54 |
| cosine_precision@5 | 0.178 |
| cosine_precision@10 | 0.092 |
| cosine_recall@1 | 0.54 |
| cosine_recall@5 | 0.89 |
| cosine_recall@10 | 0.92 |
| cosine_ndcg@5 | 0.7328 |
| cosine_ndcg@10 | 0.742 |
| cosine_ndcg@100 | 0.7589 |
| cosine_mrr@5 | 0.68 |
| cosine_mrr@10 | 0.6835 |
| cosine_mrr@100 | 0.6868 |
| cosine_map@100 | 0.6868 |
| dot_accuracy@1 | 0.55 |
| dot_accuracy@5 | 0.89 |
| dot_accuracy@10 | 0.92 |
| dot_precision@1 | 0.55 |
| dot_precision@5 | 0.178 |
| dot_precision@10 | 0.092 |
| dot_recall@1 | 0.55 |
| dot_recall@5 | 0.89 |
| dot_recall@10 | 0.92 |
| dot_ndcg@5 | 0.7365 |
| dot_ndcg@10 | 0.7457 |
| dot_ndcg@100 | 0.7626 |
| dot_mrr@5 | 0.685 |
| dot_mrr@10 | 0.6885 |
| dot_mrr@100 | 0.6918 |
| **dot_map@100** | **0.6918** |
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 32
- `per_device_eval_batch_size`: 32
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `num_train_epochs`: 1.0
- `warmup_ratio`: 0.1
- `load_best_model_at_end`: True
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 32
- `per_device_eval_batch_size`: 32
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 1.0
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: True
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: False
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `eval_use_gather_object`: False
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | loss | val_evaluator_dot_map@100 |
|:----------:|:------:|:-------------:|:----------:|:-------------------------:|
| **0.5172** | **15** | **1.8109** | **1.2075** | **0.6918** |
| 1.0 | 29 | - | 1.2075 | 0.6918 |
* The bold row denotes the saved checkpoint.
### Framework Versions
- Python: 3.10.14
- Sentence Transformers: 3.0.1
- Transformers: 4.43.4
- PyTorch: 2.4.1+cu121
- Accelerate: 0.33.0
- Datasets: 2.21.0
- Tokenizers: 0.19.1
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### GISTEmbedLoss
```bibtex
@misc{solatorio2024gistembed,
title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning},
author={Aivin V. Solatorio},
year={2024},
eprint={2402.16829},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->